Document Loader properties

Important: Agents is currently in beta. For more information, see the Nintex Beta terms document.

Document loader properties define the settings used to import and process documents. Use these settings to specify supported file types, content extraction options, and document handling behavior before the document is prepared for use.

Edit Document Loader properties

  1. Go to Agents > Document Stores.

  2. Click the document store containing the document loader you want to edit.

  3. Click Options().

  4. Select Preview & Process.

    The document loader properties page is displayed.

  5. Update the required settings, then click Process.

Document loader types and properties

Best practices for handling line breaks in document stores

When processing documents in a document store, line breaks can affect how text is stored, split, and prepared for embeddings. Configure the line break carefully to maintain readable content while preparing text suitable for AI models.

Clean text during upload when documents contain inconsistent formatting

Cleaning text during upload creates a consistent stored version of the document. This step removes unnecessary spacing and formatting issues so that the stored content remains clean and easier to use across platform features.

Use upload cleanup when documents originate from multiple sources, such as:

  • PDFs with inconsistent line breaks.

  • OCR generated text with broken lines.

  • Emails with irregular spacing.

  • Content copied from websites.

A clean stored version improves readability and supports downstream processes such as previewing documents, building search indexes, and synchronizing or exporting content.

Prepare text for the AI model during embedding

Embedding preparation focuses on optimizing text for AI processing rather than storage. Removing unnecessary line breaks during this stage helps the model read the content consistently and improves chunking and retrieval.

Embedding cleanup may include:

  • Removing single line breaks while preserving paragraph structure.

  • Trimming unnecessary spacing.

  • Normalizing text formatting before generating embeddings.

Apply this step to ensure the text format aligns with model requirements.

Cleanup during upload and embedding when documents are inconsistent

In some scenarios, cleaning text during both upload and embedding produces better results. This approach creates a clean stored document while still allowing model-specific formatting adjustments.

Use cleanup at both stages when:

  • Documents originate from multiple or inconsistent sources.

  • The document store must maintain clean content for previews or indexing.

  • Embedding rules may change over time.

  • Upload and embedding processes occur at different times.

Separating these steps keeps the stored content consistent while allowing embedding behavior to evolve.

Preserve line breaks when formatting is meaningful

Documents rely on line breaks for structure. Removing them may change the meaning or reduce clarity.

Preserve line breaks for documents such as:

  • Source code or technical snippets.

  • Legal or policy documents.

  • Structured content where spacing conveys meaning.

Review document formatting before enabling line break cleanup to ensure important structure is preserved.