Documentation Index

Fetch the complete documentation index at: https://kb.ctera.com/llms.txt

Use this file to discover all available pages before exploring further.

Collectors

Prev Next

A collector is designed to gather and organize the massive amounts of data required to train or operate AI models (like LLMs), which require high-quality datasets to learn.

To add a colllector:
After signing in, the Dashboard page is displayed.
Image

  1. Click Collectors in the navigation pane.
    Image
  2. Click New Collector.
    Image
  3. Enter the following details in the General tab:
    Name – A name to identify the collector.
    URI – The URI to access the collector.
    Default Collector – Slide on to make this collector the default collector.
    Collector Status – Select whether you want the status to be Enabled or Disabled
  4. Click Connect in the URI field to connect to the collector and retrieve file type mappings.
  5. Click the File Processing tab.
    Image
  6. For each file type, select the Reader if you don't want the default.
    Note

    The list of readers is dependent on the file type.

  7. Click Save Changes.

Image

To edit a collector:

  1. Click Collectors in the navigation pane.
    Image
  2. Click the collector to edit.
    Image
  3. Make the changes in both the General and File Processing tabs as required.
  4. Click Save Changes.