Built-in processors let you enrich or validate data without maintaining custom code. Combine them to prepare content for semantic search, normalize addresses, or orchestrate human review.Documentation Index
Fetch the complete documentation index at: https://docs.clinia.com/llms.txt
Use this file to discover all available pages before exploring further.
Vectorizer
Transforms text into numerical embeddings so you can run hybrid search. Configuration:inputProperty— Path to the text field or the output of a previous processor.modelId— Embedding model to use (for examplemte-base-v1ormte-base-knowledge-v1for dense vectors, ormte-base-sparse-v1for sparse embeddings).propertyKey— Destination sub-property that stores the resulting vector.
Optical Character Recognition
Currently available on object collections. The processor reads the uploaded file attached to the object.
markdown field alongside the original binary.
Actionable
Design actionable steps for scenarios where automated processors cannot make the final decision.Schema Validator
Adds an explicit validation checkpoint mid-pipeline. This reuses the rules defined in your profiles and complements the implicit validation that occurs at the end. Use it to:- Stop the pipeline before an expensive processor when data is incomplete.
- Re-validate after a mutation step to ensure enriched data stays compliant.
Mutating vs. enriching processors
- Mutating processors (Clinia Function, OCR) replace the input property with enriched data. Update your schema first so the new shape passes validation.
- Enriching processors (Vectorizer) add derived properties under
enrichedProperties. They keep the original field intact while making extra data available to partitions.