- Query the DDB API for Newspapers
- returns a Pandas Dataframe Object
Usage:
from ddbapi import zp_issues, zp_pages, list_column, filter
df = zp_issues(publication_date='[1600-09-01T12:00:00Z TO 1699-12-31T12:00:00Z]')
print(df)
- Returns a DataFrame containing Data on Newspaper-Issues.
- Use any combination of these keyword arguments:
language
: Use ISO Codes, currentlyger
,eng
,fre
,spa
,ita
place_of_distribution
: Search inside "Verbreitungsort", use a list for multiple search-wordspublication_date
: Get newspapers by publication date. Use the following format:1900-12-31T12:00:00Z
for a specific date, use square brackets andTO
between two dates to get a daterange like so:publication_date='[1935-09-01T12:00:00Z TO 1935-09-22T12:00:00Z]'
- time is always12:00:00Z
.zdb_id
: Search by ZDB-IDprovider
: Search by Data Providerpaper_title
: Search inside the title of the Newspaper
- Returns a DataFrame containing Data on Newspaper-Pages.
- Use any combination of these keyword arguments:
plainpagefulltext
: Search inside the OCR Fulltext (Use a list for multiple search-words)language
: Use ISO Codes, currentlyger
,eng
,fre
,spa
,ita
place_of_distribution
: Search inside "Verbreitungsort", use a list for multiple search-wordspublication_date
: Get newspapers by publication date. Use the following format:1900-12-31T12:00:00Z
for a specific date, use square brackets andTO
between two dates to get a daterange like so:publication_date='[1935-09-01T12:00:00Z TO 1935-09-22T12:00:00Z]'
- time is always12:00:00Z
.zdb_id
: Search by ZDB-IDprovider
: Search by Data Providerpaper_title
: Search inside the title of the Newspaper
- Values of keyword arguments may contain lists to combine queries.
- Use
list_column
andfilter
to perform usual Pandas Operations on list-containing Columns (eg.list_column(df['place_of_distribution']).value_counts()
orfilter('Altona', 'place_of_distribution', df)
)
See this Notebook for a usage example.