Parser

Starting in version 0.3.0, secedgar will start to implement a general-purpose parser.

This is expected to be an ambitious task and may not be feasible for all filing types. However, given user feedback, we feel that it is worth a shot. Please note that parsing will remain experimental for the foreseeable future.

Parsing Meta Data

class secedgar.parser.MetaParser

Utility class to extract metadata and documents from a single text file.

Warning

The MetaParser class is still experimental. Use with caution.

New in version 0.3.0.

process(infile, out_dir=None, create_subdir=True, rm_infile=False)

Process a text file and save processed files.

Parameters
  • infile (str) – Full path to a text file.

  • out_dir (str) – Directory to store output files. Defaults to the parent directory of infile.

  • create_subdir (bool) – If a subdirectory with the name of the infile should be created. If this is not true, files will be prefixed with the infile filename.

  • rm_infile (bool) – If the infile should be removed after processing. Defaults to False.

Returns

None

static process_document_metadata(doc)

Process the metadata of an embedded document.

Parameters

doc (str) – Document to extract meta data from.

Returns

Dictionary with fields parsed from document.

Return type

dict

static process_metadata(curr_doc)

Process the metadata of the focal document.

Parameters

curr_doc (str) – Process meta data for single focal document.

Returns

Meta data from focal document.

Return type

out_dict (dict)

Parsing Form 4 Data

class secedgar.parser.F4Parser

Utility class to extract actionable data and documents from a single text file.

Warning

The F4Parser class is still experimental. Use with caution.

New in version 0.4.0.

static process(doc)

Process the actionable data of the document.

Parameters

doc (str) – Document from which to extract core data.

Returns

Tradable buy/sell/gift data from document.

Return type

data (dict)