ResearchScienceSearchSourceUrlDocjoinInfo

GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfo


Table of Contents ▼

Jump to a specific part of the page:

Description

The proto containing all the information we extracted from docjoin, for the source_url of the dataset. NEXT TAG: 18

Attributes List

This module has the following attributes (case-insensitive ascending order):

View Attributes

Attributes

  1. dataSource (type: String.t, default: nil)
    -
  2. displayUrl (type: String.t, default: nil)
    - The url used to display in the google search results.
  3. docid (type: String.t, default: nil)
    - The docid of the document.
  4. indexTier (type: list(String.t), default: nil)
    - Index tiers (BASE, UNIFIED_ZEPPELIN, etc) that the document belongs to. NOTE: Each document may belong to multiple tiers. NOTE: The original data type is an enum CompositeDoc::SubIndexType. However we don't want to depend on segindexer/compositedoc.proto because the proto is too large. Instead, we use CompositeDoc::SubIndexType_Name( subindexid) to convert into a string representation. To convert string back to CompositeDoc::SubIndexType, use CompositeDoc::SubIndexType_Parse.
  5. languageCode (type: String.t, default: nil)
    - The language of the document in the string representation of LanguageCode. Converts from Language Enum to LanguageCode through i18n/identifiers/langenclanguagecodeconverter.h Please use i18n/identifiers/languagecodeconverter.h for converting between LanguageCode and string representation.
  6. latestPageUpdateDate (type: String.t, default: nil)
    - The syntactic date of a dataset document that reflects the publication date of the content.
  7. navboostQuery (type: list(GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchNavboostQueryInfo), default: nil)
    - A sequence of Navboost queries for the dataset source_url.
  8. pagerank (type: integer(), default: nil)
    - The page rank of the document. DEPRECATED in favour of Pagerank_NS. Do not use as it is no longer maintained in docjoins and can break at any moment.
  9. pagerankNs (type: integer(), default: nil)
    - The production pagerank value of the document.
  10. petacatInfo (type: GoogleApi.ContentWarehouse.V1.Model.FatcatCompactDocClassification, default: nil)
    - Petacat classifications for the web document. Normally the results from calling Petacat come in a PetacatResponse, which is very flexible and extensible. This proto takes most of the flexibility away - only rephil clusters, taxonomic classifications, and binary classifications, with discretized weights.
  11. salientTerms (type: GoogleApi.ContentWarehouse.V1.Model.QualitySalientTermsSalientTermSet, default: nil)
    - A set of salient terms extracted fromthe document. DEPRECATEAD. Moved to DatasetMetadata for performance reasons.
  12. scholarInfo (type: GoogleApi.ContentWarehouse.V1.Model.ScienceIndexSignal, default: nil)
    - Science per-doc data for inclusion in websearch.
  13. sporeGraphMid (type: list(String.t), default: nil)
    - A set of entities from WebRef annotations that are in SPORE_GRAPH.
  14. title (type: String.t, default: nil)
    - The title of the document.
  15. topEntity (type: list(GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefWebrefEntity), default: nil)
    - A set of top entities from WebrefAnnotation, top is defined by topicality score, see go/topicality-score for detail. DEPRECATED. See label_to_mids_map instead.
  16. url (type: String.t, default: nil)
    - The url of the document.
  17. webrefEntity (type: list(GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfoWebrefEntityInfo), default: nil)
    - A set of entities copied from WebRefEntities on cDoc.

Type

@type t() :: %GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfo{
dataSource: String.t() | nil,
displayUrl: String.t() | nil,
docid: String.t() | nil,
indexTier: [String.t()] | nil,
languageCode: String.t() | nil,
latestPageUpdateDate: String.t() | nil,
navboostQuery: [ GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchNavboostQueryInfo.t() ] | nil,
pagerank: integer() | nil,
pagerankNs: integer() | nil,
petacatInfo: GoogleApi.ContentWarehouse.V1.Model.FatcatCompactDocClassification.t() | nil,
salientTerms: GoogleApi.ContentWarehouse.V1.Model.QualitySalientTermsSalientTermSet.t() | nil,
scholarInfo: GoogleApi.ContentWarehouse.V1.Model.ScienceIndexSignal.t() | nil,
sporeGraphMid: [String.t()] | nil,
title: String.t() | nil,
topEntity: [GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefWebrefEntity.t()] | nil,
url: String.t() | nil,
webrefEntity: [ GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfoWebrefEntityInfo.t() ] | nil
}

Function

@spec decode(struct(), keyword()) :: struct()

Data sourced from HexDocs : GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfo