GoogleCloudDocumentaiV1Document

GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1Document


Table of Contents ▼

Jump to a specific part of the page:

Description

Document represents the canonical document resource in Document AI. It is an interchange format that provides insights into documents and allows for collaboration between users and Document AI to iterate and optimize for quality.

Attributes List

This module has the following attributes (case-insensitive ascending order):

View Attributes

Attributes

  1. content (type: String.t, default: nil)
    - Optional. Inline document content, represented as a stream of bytes. Note: As with all bytes fields, protobuffers use a pure binary representation, whereas JSON representations use base64.
  2. entities (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentEntity), default: nil)
    - A list of entities detected on Document.text. For document shards, entities in this list may cross shard boundaries.
  3. entityRelations (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentEntityRelation), default: nil)
    - Placeholder. Relationship among Document.entities.
  4. error (type: GoogleApi.ContentWarehouse.V1.Model.GoogleRpcStatus, default: nil)
    - Any error that occurred while processing this document.
  5. mimeType (type: String.t, default: nil)
    - An IANA published media type (MIME type).
  6. pages (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPage), default: nil)
    - Visual page layout for the Document.
  7. revisions (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentRevision), default: nil)
    - Placeholder. Revision history of this document.
  8. shardInfo (type: GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentShardInfo, default: nil)
    - Information about the sharding if this document is sharded part of a larger document. If the document is not sharded, this message is not specified.
  9. text (type: String.t, default: nil)
    - Optional. UTF-8 encoded text in reading order from the document.
  10. textChanges (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentTextChange), default: nil)
    - Placeholder. A list of text corrections made to Document.text. This is usually used for annotating corrections to OCR mistakes. Text changes for a given revision may not overlap with each other.
  11. textStyles (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentStyle), default: nil)
    - Styles for the Document.text.
  12. uri (type: String.t, default: nil)
    - Optional. Currently supports Google Cloud Storage URI of the form gs://bucket_name/object_name. Object versioning is not supported. For more information, refer to Google Cloud Storage Request URIs.

Type

Function

@spec decode(struct(), keyword()) :: struct()

Data sourced from HexDocs : GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1Document