aleph.metadata

Metadata

class aleph.metadata.Metadata

Imports

  • object

Summary

Handle all sorts of metadata normalization for documents.

from_data(cls, data)

Instantiate a Metadata object based on a given dict.

facets(cls)
__init__(self)
__repr__(self)
add_country(self, country)
add_date(self, obj)
add_domain(self, domain)
add_email(self, email)
add_keyword(self, kw)
add_language(self, language)
add_phone_number(self, number)
add_url(self, url)
clone(self)
content_hash(self, content_hash)
countries(self, countries)
dates(self, dates)
domains(self, domains)
emails(self, emails)
extension(self, extension)
file_name(self, file_name)
file_title(self)
foreign_id(self, foreign_id)
has(self, name)
headers(self, headers)
is_pdf(self)
keywords(self, keywords)
languages(self, languages)
make_child(self)
mime_type(self, mime_type)
parent(self, parent)
pdf(self)
phone_numbers(self, phone_numbers)
tables(self, tables)
title(self, title)
to_attr_dict(self)

Return the data for each attribute.

to_dict(self)

Generate REST API form.

to_index_dict(self)

Generate ElasticSearch form.

update(self, data)
urls(self, urls)

PDFAlternative

class aleph.metadata.PDFAlternative

Imports

  • object

Summary

Alternate PDF version.

__init__(self, meta)
content_hash(self, content_hash)