Skip to content

os-climate/DERA-ingest-pipeline

Repository files navigation

SEC DERA Ingestion Pipeline

The Division of Economic and Risk Analysis (DERA) was created in September 2009 to integrate financial economics and rigorous data analytics into the core mission of the SEC. The Division is involved across the entire range of SEC activities, including policy-making, rule-making, enforcement, and examination.

Data is central to DERA's mission. The SEC requires all companies that trade on US stock exchanges to make certain data available, and DERA collects and publishes that data. A summary for of that data is listed as Financial Statements and a more detailed version is listed as Financial Statements and Notes.

The principal notebook for implementing this pipeline is DERA-ingest. It performs a basic ingestion of the SEC data, marrying company names to Global Legal Entity Identifiers where it can. There are hundreds of millions of rows of data for just the past few years of annual reports, and we could increase that considerably by ingesting quarterly reports as well. But so far there's no need for that.

Once the basic financial data has been ingested, a second notebook (SEC Corp Financials) makes the data "easy to use" by summarizing market float (aka market cap), annual revneues, income, and reported cash, debt, and assets at the time of the annual report. We also apply some crosswalks to make it easier to connect the SIC-based SEC data with ISIC codes used internationally.

In the future this pipeline will use the ESG Matching services of the Data Commons. As well we are looking at ingesting the more detailed DERA information so that analyses can be run on business segments, not only whole consolidated reporting entities.

If you have questions, please file Issues. If you have answers, please contribute Pull Requests!


Project based on the cookiecutter data science project template.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •