GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfo
Table of Contents ▼
Jump to a specific part of the page:
Description
The proto containing all the information we extracted from docjoin, for the source_url of the dataset. NEXT TAG: 18
Attributes List
This module has the following attributes (case-insensitive ascending order):
Attributes
-
dataSource
(type:String.t
, default:nil
)
- -
displayUrl
(type:String.t
, default:nil
)
- The url used to display in the google search results. -
docid
(type:String.t
, default:nil
)
- The docid of the document. -
indexTier
(type:list(String.t)
, default:nil
)
- Index tiers (BASE, UNIFIED_ZEPPELIN, etc) that the document belongs to. NOTE: Each document may belong to multiple tiers. NOTE: The original data type is an enum CompositeDoc::SubIndexType. However we don't want to depend on segindexer/compositedoc.proto because the proto is too large. Instead, we use CompositeDoc::SubIndexType_Name( subindexid) to convert into a string representation. To convert string back to CompositeDoc::SubIndexType, use CompositeDoc::SubIndexType_Parse. -
languageCode
(type:String.t
, default:nil
)
- The language of the document in the string representation of LanguageCode. Converts from Language Enum to LanguageCode through i18n/identifiers/langenclanguagecodeconverter.h Please use i18n/identifiers/languagecodeconverter.h for converting between LanguageCode and string representation. -
latestPageUpdateDate
(type:String.t
, default:nil
)
- The syntactic date of a dataset document that reflects the publication date of the content. -
pagerank
(type:integer()
, default:nil
)
- The page rank of the document. DEPRECATED in favour of Pagerank_NS. Do not use as it is no longer maintained in docjoins and can break at any moment. -
pagerankNs
(type:integer()
, default:nil
)
- The production pagerank value of the document. -
petacatInfo
(type:GoogleApi.ContentWarehouse.V1.Model.FatcatCompactDocClassification
, default:nil
)
- Petacat classifications for the web document. Normally the results from calling Petacat come in a PetacatResponse, which is very flexible and extensible. This proto takes most of the flexibility away - only rephil clusters, taxonomic classifications, and binary classifications, with discretized weights. -
salientTerms
(type:GoogleApi.ContentWarehouse.V1.Model.QualitySalientTermsSalientTermSet
, default:nil
)
- A set of salient terms extracted fromthe document. DEPRECATEAD. Moved to DatasetMetadata for performance reasons. -
scholarInfo
(type:GoogleApi.ContentWarehouse.V1.Model.ScienceIndexSignal
, default:nil
)
- Science per-doc data for inclusion in websearch. -
sporeGraphMid
(type:list(String.t)
, default:nil
)
- A set of entities from WebRef annotations that are in SPORE_GRAPH. -
title
(type:String.t
, default:nil
)
- The title of the document. -
topEntity
(type:list(GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefWebrefEntity)
, default:nil
)
- A set of top entities from WebrefAnnotation, top is defined by topicality score, see go/topicality-score for detail. DEPRECATED. See label_to_mids_map instead. -
url
(type:String.t
, default:nil
)
- The url of the document. -
webrefEntity
(type:list(GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfoWebrefEntityInfo)
, default:nil
)
- A set of entities copied from WebRefEntities on cDoc.
Type
@type t() :: %GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfo{
dataSource: String.t() | nil,
displayUrl: String.t() | nil,
docid: String.t() | nil,
indexTier: [String.t()] | nil,
languageCode: String.t() | nil,
latestPageUpdateDate: String.t() | nil,
navboostQuery: [ GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchNavboostQueryInfo.t() ] | nil,
pagerank: integer() | nil,
pagerankNs: integer() | nil,
petacatInfo: GoogleApi.ContentWarehouse.V1.Model.FatcatCompactDocClassification.t() | nil,
salientTerms: GoogleApi.ContentWarehouse.V1.Model.QualitySalientTermsSalientTermSet.t() | nil,
scholarInfo: GoogleApi.ContentWarehouse.V1.Model.ScienceIndexSignal.t() | nil,
sporeGraphMid: [String.t()] | nil,
title: String.t() | nil,
topEntity: [GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefWebrefEntity.t()] | nil,
url: String.t() | nil,
webrefEntity: [ GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfoWebrefEntityInfo.t() ] | nil
}
dataSource: String.t() | nil,
displayUrl: String.t() | nil,
docid: String.t() | nil,
indexTier: [String.t()] | nil,
languageCode: String.t() | nil,
latestPageUpdateDate: String.t() | nil,
navboostQuery: [ GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchNavboostQueryInfo.t() ] | nil,
pagerank: integer() | nil,
pagerankNs: integer() | nil,
petacatInfo: GoogleApi.ContentWarehouse.V1.Model.FatcatCompactDocClassification.t() | nil,
salientTerms: GoogleApi.ContentWarehouse.V1.Model.QualitySalientTermsSalientTermSet.t() | nil,
scholarInfo: GoogleApi.ContentWarehouse.V1.Model.ScienceIndexSignal.t() | nil,
sporeGraphMid: [String.t()] | nil,
title: String.t() | nil,
topEntity: [GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefWebrefEntity.t()] | nil,
url: String.t() | nil,
webrefEntity: [ GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfoWebrefEntityInfo.t() ] | nil
}
Function
@spec decode(struct(), keyword()) :: struct()Data sourced from HexDocs : GoogleApi.ContentWarehouse.V1.Model.ResearchScienceSearchSourceUrlDocjoinInfo