Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
178 views11 pages

Google Books Metadata Requirements

This document provides Google Books' requirements for metadata from new data providers, including required and helpful bibliographic, author, and series fields. Key requirements are title, author name(s), contributor roles, and universal identifiers. Data can be submitted in MARC or ONIX format and must represent individual books, series, or authors with examples of each provided.

Uploaded by

Nandy Seal Ouw
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
178 views11 pages

Google Books Metadata Requirements

This document provides Google Books' requirements for metadata from new data providers, including required and helpful bibliographic, author, and series fields. Key requirements are title, author name(s), contributor roles, and universal identifiers. Data can be submitted in MARC or ONIX format and must represent individual books, series, or authors with examples of each provided.

Uploaded by

Nandy Seal Ouw
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

 

Google Books Metadata Requirements for 


New Data Providers 
Author: ​[email protected] 

Last Updated: 8 January 2021


 

Background 

Google Books and the corresponding Books Vertical on Google Search covers all 
types of bibliographic information and structured data. Our goal is to provide a 
comprehensive index of every book ever published in the world. In order to 
achieve this goal, we license and ingest book metadata from providers around the 
world.  

Data Needs 

1. Bibliographic 
❏ Information about Books 
❏ Any form that books take (print hardcover, print paperback, ebook, audiobook, etc.) 
❏ Book subjects, genre, or categorical information 

2. Author / Entity 
❏ Information about People who have contributed to books 
❏ Information about the person (name, birth date, etc.) 
❏ Books this person has written or contributed to in some fashion 

3. Series 
❏ Information about books that are published in a sequence or recurring fashion 
❏ Includes literary series (example), publisher collections (example), comic collections, 
manga 
❏ List of books that occur in the series & order of the books including numbering 

 
 

Key Field Requirements 

Required Fields 
● Title / Subtitle 
● Names of Author(s) or other contributors  
● Roles of contributors such as editor, illustrator, photographer, etc. 
● Universal Identifiers such as ISBN, OCLC, LCCN, ELAN, etc. (required if applicable) 
● Publisher 
● Publication date 
● Language of text 
● Type/format of book (physical book, audiobook, ebook) 
● Timestamp 

Helpful Fields 
● Book summary / description (Highly recommended) 
● Pagination (page count) 
● Subject of book 
● Genre of book 
● Intended audience (appropriate age or age range) 
● Series name and number 

Data Structure 
Book data pipelines are set up to handle canonical “records” that describe one conceptual book 
edition at a time. A book edition is a version of a written work that has been published and made 
accessible to readers. More information about Works and Editions can be found on the ​developer 
website here.​ A single record must represent one book, however this can sometimes mean a book that 
is split up into multiple volumes (example) or a physical book that contains multiple works (aka 
omnibus, example). 

 
 

If providing Series data, each record must represent a single series with a single name. If the name 
changes over time, a new record should be created for each new name. If the publisher changes, but 
the name stays the same, the same record can be used.  

If providing author information, each record should represent a single person or entity. Alternate 
names should be listed on the main record. Pen names can be listed on the main record, or, if 
significant to the author’s identity, a new record can be created for the pen name.  

This is a very brief overview. Please reach out with additional questions. 

Data Format 
Book data can be ingested in one of two formats: 

● MARC 
○ Library of Congress guidelines: ​Record Structure: MARC 21 Specifications for Record 
Structure, Character Sets, and Exchange Media 
○ Info from Itsmarc.com: ​A General Introduction to the MARC Format 
○ Guidance from OCLC: ​Bibliographic Formats and Standards 
● ONIX 
○ ONIX for Books official site: ​https://www.editeur.org/83/Overview/ 
○ ONIX 3.0 for Google Play Books: ​ONIX 3.0 for Google Play Books - Google Play Books 
Partner Center Help 
○ ONIX 2.1 for Google Play Books: ​ONIX 2.1 for Google Play Books - Google Play Books 
Partner Center Help 

 
Examples 
 
ONIX 3.0 Example  
<?xml version="1.0"?>
<ONIXMessage xmlns="http://ns.editeur.org/onix/3.0/reference"
release="3.0">

<Header>
<Sender>

 
 

<SenderName>My Publishing Company or Client Services


Provider</SenderName>
<ContactName>Alice Smith</ContactName>
<EmailAddress>[email protected]</EmailAddress>
</Sender>
<Addressee>
<AddresseeName>Google</AddresseeName>
</Addressee>
<SentDateTime>20150723T142048Z</SentDateTime>
</Header>

<Product>

<RecordReference>myid.9789999999991</RecordReference>
<NotificationType>04</NotificationType>

<!-- The record must have at least one valid ISBN, when possible,
type = 15 recommended -->
<ProductIdentifier>
<ProductIDType>15</ProductIDType>
<IDValue>9789999999991</IDValue>
</ProductIdentifier>

<DescriptiveDetail>
<ProductComposition>00</ProductComposition>
<ProductForm>ED</ProductForm>
<ProductFormDetail>E101</ProductFormDetail>
<PrimaryContentType>10</PrimaryContentType>
<EpubTechnicalProtection>02</EpubTechnicalProtection>
<NoCollection />
<TitleDetail>
<TitleType>01</TitleType>
<TitleElement>
<TitleElementLevel>01</TitleElementLevel>
<TitleText>This Is My Distinctive Title</TitleText>
<Subtitle>This Is A Subtitle</Subtitle>
</TitleElement>
</TitleDetail>
<Contributor>
<SequenceNumber>1</SequenceNumber>
<ContributorRole>A01</ContributorRole>
<PersonName>Jane Smith</PersonName>
</Contributor>

 
 

<EditionType>DGO</EditionType>
<Language>
<LanguageRole>01</LanguageRole>
<LanguageCode>eng</LanguageCode>
</Language>
<Extent>
<ExtentType>10</ExtentType>
<ExtentValue>1234</ExtentValue>
<ExtentUnit>03</ExtentUnit>
</Extent>
<Subject>
<MainSubject/>
<!-- BISAC classification -->
<SubjectSchemeIdentifier>10</SubjectSchemeIdentifier>
<SubjectCode>LIT014000</SubjectCode>
</Subject>
<Subject>
<!-- BISAC classification -->
<SubjectSchemeIdentifier>10</SubjectSchemeIdentifier>
<SubjectCode>LIT024060</SubjectCode>
</Subject>
</DescriptiveDetail>

<CollateralDetail>
<TextContent>
<TextType>02</TextType>
<ContentAudience>00</ContentAudience>
<Text>I am a short description of the book.</Text>
</TextContent>
<TextContent>
<TextType>03</TextType>
<ContentAudience>00</ContentAudience>
<Text>I am a description of the book.</Text>
</TextContent>
<TextContent>
<TextType>12</TextType>
<ContentAudience>00</ContentAudience>
<Text>I am a biographical note of a book not a specific
author.</Text>
</TextContent>
</CollateralDetail>

<PublishingDetail>

 
 

<Imprint>
<ImprintName>My Publishing Imprint</ImprintName>
</Imprint>
<Publisher>
<PublishingRole>01</PublishingRole>
<PublisherName>My Publisher Name</PublisherName>
</Publisher>
<PublishingStatus>04</PublishingStatus>
<PublishingDate>
<PublishingDateRole>01</PublishingDateRole>
<DateFormat>00</DateFormat>
<Date>20120927</Date> <!-- YYYY and YYYYMM also valid -->
</PublishingDate>
<SalesRights>
<SalesRightsType>01</SalesRightsType>
<Territory>
<RegionsIncluded>WORLD</RegionsIncluded>
</Territory>
</SalesRights>
</PublishingDetail>

<!-- Related material not required if book is digital-only. -->


<RelatedMaterial/>

<ProductSupply>
<SupplyDetail>
<Supplier>
<SupplierRole>03</SupplierRole>
<SupplierName>My Supplier Name</SupplierName>
</Supplier>
<ProductAvailability>20</ProductAvailability>
<Price>
<PriceType>02</PriceType> <!-- This should match the price
type that you are configured to sell in the Partner Center -->
<PriceAmount>9.99</PriceAmount>
<CurrencyCode>USD</CurrencyCode>
<Territory>
<CountriesIncluded>US</CountriesIncluded>
</Territory>
</Price>
</SupplyDetail>
</ProductSupply>
</Product>

 
 

</ONIXMessage>
 
MARCXML Example  
<record xmlns="http://www.loc.gov/MARC21/slim">
<leader>00000cam a2200000Mi 4500</leader>
<controlfield tag="001">879398102</controlfield>
<controlfield tag="008">140327r20141952enka j 000 f eng
d</controlfield>
<datafield tag="015" ind1=" " ind2=" ">
<subfield code="a">GBB446685</subfield>
<subfield code="2">bnb</subfield>
</datafield>
<datafield tag="015" ind1=" " ind2=" ">
<subfield code="a">GBB338719</subfield>
<subfield code="2">bnb</subfield>
</datafield>
<datafield tag="016" ind1="7" ind2=" ">
<subfield code="a">016707475</subfield>
<subfield code="2">Uk</subfield>
</datafield>
<datafield tag="016" ind1="7" ind2=" ">
<subfield code="a">016320739</subfield>
<subfield code="2">Uk</subfield>
</datafield>
<datafield tag="019" ind1=" " ind2=" ">
<subfield code="a">840431503</subfield>
<subfield code="a">862656884</subfield>
<subfield code="a">965993263</subfield>
<subfield code="a">986553405</subfield>
<subfield code="a">986675812</subfield>
<subfield code="a">988646968</subfield>
<subfield code="a">989046668</subfield>
</datafield>
<datafield tag="020" ind1=" " ind2=" ">
<subfield code="a">9780141354828</subfield>
<subfield code="q">(paperback)</subfield>
</datafield>
<datafield tag="020" ind1=" " ind2=" ">
<subfield code="a">0141354828</subfield>
<subfield code="q">(paperback)</subfield>
</datafield>
<datafield tag="020" ind1=" " ind2=" ">
<subfield code="a">9780141331331</subfield>
<subfield code="q">(paperback)</subfield>
</datafield>

 
 

<datafield tag="020" ind1=" " ind2=" ">


<subfield code="a">014133133X</subfield>
<subfield code="q">(paperback)</subfield>
</datafield>
<datafield tag="020" ind1=" " ind2=" ">
<subfield code="a">0141317345</subfield>
</datafield>
<datafield tag="020" ind1=" " ind2=" ">
<subfield code="a">9780141317342</subfield>
</datafield>
<datafield tag="050" ind1=" " ind2="4">
<subfield code="a">PZ7.W58277</subfield>
<subfield code="b">.C47 2013</subfield>
</datafield>
<datafield tag="082" ind1="0" ind2="4">
<subfield code="a">813.52</subfield>
<subfield code="2">23</subfield>
</datafield>
<datafield tag="084" ind1=" " ind2=" ">
<subfield code="a">I712.45</subfield>
<subfield code="2">clc</subfield>
</datafield>
<datafield tag="100" ind1="1" ind2=" ">
<subfield code="a">White, E. B.</subfield>
<subfield code="q">(Elwyn Brooks),</subfield>
<subfield code="d">1899-1985,</subfield>
<subfield code="e">author.</subfield>
</datafield>
<datafield tag="245" ind1="1" ind2="0">
<subfield code="a">Charlotte's web /</subfield>
<subfield code="c">E.B. White ; illustrated by Garth
Williams.</subfield>
</datafield>
<datafield tag="264" ind1=" " ind2="1">
<subfield code="a">London :</subfield>
<subfield code="b">Puffin,</subfield>
<subfield code="c">2014.</subfield>
</datafield>
<datafield tag="300" ind1=" " ind2=" ">
<subfield code="a">184 pages :</subfield>
<subfield code="b">color illustrations ;</subfield>
<subfield code="c">27 cm</subfield>
</datafield>
<datafield tag="336" ind1=" " ind2=" ">
<subfield code="a">text</subfield>
<subfield code="b">txt</subfield>
<subfield code="2">rdacontent</subfield>
</datafield>

 
 

<datafield tag="336" ind1=" " ind2=" ">


<subfield code="a">still image</subfield>
<subfield code="b">sti</subfield>
<subfield code="2">rdacontent</subfield>
</datafield>
<datafield tag="337" ind1=" " ind2=" ">
<subfield code="a">unmediated</subfield>
<subfield code="b">n</subfield>
<subfield code="2">rdamedia</subfield>
</datafield>
<datafield tag="338" ind1=" " ind2=" ">
<subfield code="a">volume</subfield>
<subfield code="b">nc</subfield>
<subfield code="2">rdacarrier</subfield>
</datafield>
<datafield tag="490" ind1="0" ind2=" ">
<subfield code="a">Puffin modern classics</subfield>
</datafield>
<datafield tag="500" ind1=" " ind2=" ">
<subfield code="a">Originally published: London: Hamish Hamilton,
1952.</subfield>
</datafield>
<datafield tag="520" ind1="8" ind2=" ">
<subfield code="a">One spring morning a little girl called Fern
rescues a runt and names him Wilbur. But then Wilbur is sent to live
on a farm where he meets Charlotte, a beautiful large grey spider.
They become best friends and, when Wilbur is faced with a dreadful
fate, Charlotte must find a clever way to save him.</subfield>
</datafield>
<datafield tag="505" ind1="0" ind2=" ">
<subfield code="a">Machine generated contents note: i. Before
Breakfast -- ii. Wilbur -- iii. Escape -- iv. Loneliness -- v.
Charlotte -- vi. Summer Days -- vii. Bad News -- viii. A Talk at Home
-- ix. Wilbur's Boast -- x. An Explosion -- xi. The Miracle -- xii. A
Meeting -- xiii. Good Progress -- xiv. Dr. Dorian -- xv. The Crickets
-- xvi. Off to the Fair -- xvii. Uncle -- xviii. The Cool of the
Evening -- xix. The Egg Sac -- xx. The Hour of Triumph -- xxi. Last
Day.</subfield>
</datafield>
<datafield tag="650" ind1=" " ind2="0">
<subfield code="a">Swine</subfield>
<subfield code="v">Juvenile fiction.</subfield>
</datafield>
<datafield tag="650" ind1=" " ind2="0">
<subfield code="a">Spiders</subfield>
<subfield code="v">Juvenile fiction.</subfield>
</datafield>
<datafield tag="650" ind1=" " ind2="7">

 
 

<subfield code="a">Children's stories.</subfield>


<subfield code="2">cct</subfield>
</datafield>
<datafield tag="650" ind1=" " ind2="7">
<subfield code="a">Spiders</subfield>
<subfield code="v">Juvenile fiction.</subfield>
<subfield code="2">cct</subfield>
</datafield>
<datafield tag="650" ind1=" " ind2="7">
<subfield code="a">Swine</subfield>
<subfield code="v">Juvenile fiction.</subfield>
<subfield code="2">cct</subfield>
</datafield>
<datafield tag="650" ind1=" " ind2="7">
<subfield code="a">Spiders.</subfield>
<subfield code="2">fast</subfield>
<subfield code="0">(OCoLC)fst01129749</subfield>
</datafield>
<datafield tag="650" ind1=" " ind2="7">
<subfield code="a">Swine.</subfield>
<subfield code="2">fast</subfield>
<subfield code="0">(OCoLC)fst01140419</subfield>
</datafield>
<datafield tag="655" ind1=" " ind2="0">
<subfield code="a">Children's stories.</subfield>
</datafield>
<datafield tag="655" ind1=" " ind2="7">
<subfield code="a">Fiction.</subfield>
<subfield code="2">fast</subfield>
<subfield code="0">(OCoLC)fst01423787</subfield>
</datafield>
<datafield tag="655" ind1=" " ind2="7">
<subfield code="a">Juvenile works.</subfield>
<subfield code="2">fast</subfield>
<subfield code="0">(OCoLC)fst01411637</subfield>
</datafield>
<datafield tag="700" ind1="1" ind2=" ">
<subfield code="a">Williams, Garth,</subfield>
<subfield code="e">illustrator.</subfield>
</datafield>
<datafield tag="952" ind1="1" ind2=" ">
<subfield code="a">83</subfield>
</datafield>
<datafield tag="856" ind1="1" ind2=" ">
<subfield
code="u">http://www.worldcat.org/oclc/879398102</subfield>
<subfield code="z">Find in a Library</subfield>
</datafield>

 
 

</record> 

You might also like