Dataset#

Dataset view

The Dataset page is the main page of the Rubrix web app. From here you can access most of Rubrix’s features, like exploring and annotating the records of your dataset.

The page is composed of 4 major components:

Filters#

Dataset filters

The filters provide you a quick and intuitive way to filter and sort your records with respect to various parameters. You can find more information about how to use the filters in our detailed filter guide.

Record cards#

The record cards are at the heart of the Dataset page and contain your data. There are three different flavors of record cards depending on the task of your dataset. All of them share the same basic structure showing the input text and a vertical ellipsis (or “kebab menu”) on the top right that lets you access the record’s metadata. Predictions and annotations are shown depending on the current mode and task of the dataset.

Check out our exploration and annotation guides to see how the record cards work in the different modes.

Text classification#

Text classification view

In this task the predictions are given as tags below the input text. They contain the label as well as a percentage score. When in Explore mode annotations are shown as tags on the right together with a symbol indicating if the predictions match the annotations or not. When in Annotate mode predictions and annotations share the same labels (annotation labels are darker).

A text classification dataset can support either single-label or multi-label classification - in other words, records are either annotated with one single label or various.

Token classification#

Token classification view

In this task annotations are given as colored highlights in the input text, while predictions are indicated by underlines. At the top of the record list you will find a legend that connects the colors to the respective labels. When in Annotate mode you can remove annotations or add new ones by simply selecting the desired text.

Hint

When using the score filter, the predictions that do not fall in the selected range will be missing the solid thin line.

Text2Text#

Text2Text view

In this task predictions and the annotation are given in a text field below the input text. You can switch between prediction and annotation via the “View annotation”/”View predictions” buttons. For the predictions you can find an associated score in the lower left corner. If you have multiple predictions you can toggle between them using the arrows on the button of the record card.