GoogleCloudContentwarehouseV1GcsIngestWithDocAiProcessorsPipeline

GoogleApi.ContentWarehouse.V1.Model.GoogleCloudContentwarehouseV1GcsIngestWithDocAiProcessorsPipeline

Table of Contents ▼

Jump to a specific part of the page:

Description
Attribute List
Attributes
Type
Function
Related links
Modules linked to
Possibly related Modules

Description

The configuration of the Cloud Storage Ingestion with DocAI Processors pipeline.

Attributes List

This module has the following attributes (case-insensitive ascending order):

View Attributes

extractProcessorInfos
inputPath
pipelineConfig
processorResultsFolderPath
skipIngestedDocuments
splitClassifyProcessorInfo

Attributes

extractProcessorInfos (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudContentwarehouseV1ProcessorInfo), default: nil)
- The extract processors information. One matched extract processor will be used to process documents based on the classify processor result. If no classify processor is specified, the first extract processor will be used.
inputPath (type: String.t, default: nil)
- The input Cloud Storage folder. All files under this folder will be imported to Document Warehouse. Format: gs:///.
pipelineConfig (type: GoogleApi.ContentWarehouse.V1.Model.GoogleCloudContentwarehouseV1IngestPipelineConfig, default: nil)
- Optional. The config for the Cloud Storage Ingestion with DocAI Processors pipeline. It provides additional customization options to run the pipeline and can be skipped if it is not applicable.
processorResultsFolderPath (type: String.t, default: nil)
- The Cloud Storage folder path used to store the raw results from processors. Format: gs:///.
skipIngestedDocuments (type: boolean(), default: nil)
- The flag whether to skip ingested documents. If it is set to true, documents in Cloud Storage contains key "status" with value "status=ingested" in custom metadata will be skipped to ingest.
splitClassifyProcessorInfo (type: GoogleApi.ContentWarehouse.V1.Model.GoogleCloudContentwarehouseV1ProcessorInfo, default: nil)
- The split and classify processor information. The split and classify result will be used to find a matched extract processor.

Type

@type t() :: %GoogleApi.ContentWarehouse.V1.Model.GoogleCloudContentwarehouseV1GcsIngestWithDocAiProcessorsPipeline{
extractProcessorInfos: [ GoogleApi.ContentWarehouse.V1.Model.GoogleCloudContentwarehouseV1ProcessorInfo.t() ] | nil,
inputPath: String.t() | nil,
pipelineConfig: GoogleApi.ContentWarehouse.V1.Model.GoogleCloudContentwarehouseV1IngestPipelineConfig.t() | nil,
processorResultsFolderPath: String.t() | nil,
skipIngestedDocuments: boolean() | nil,
splitClassifyProcessorInfo: GoogleApi.ContentWarehouse.V1.Model.GoogleCloudContentwarehouseV1ProcessorInfo.t() | nil
}

Function

@spec decode(struct(), keyword()) :: struct()

Data sourced from HexDocs : GoogleApi.ContentWarehouse.V1.Model.GoogleCloudContentwarehouseV1GcsIngestWithDocAiProcessorsPipeline