Questions tagged [edgar]

EDGAR is an information system of the U.S. Securities and Exchange Commission holding company data. Questions related to parsing and querying the data and public APIs should be tagged.

EDGAR stays for Electronic Data Gathering, Analysis, and Retrieval. This information system uses several data formats: classic SGML based, XML-based XBRL format for business reporting and many more.

120 questions
2
votes
2 answers

Extracting text section from (Edgar 10-K filings) HTML

I am trying to extract a certain section from HTML-files. To be specific, I look for the "ITEM 1" Section of the 10-K filings (a US business reports of a certain company).…
Heka
  • 73
  • 1
  • 8
2
votes
1 answer

Extraction of text using Beautiful Soup and regular expressions in 10-K Edgar fillings

I want to automatically extract section "1A. Risk Factors" from around 10000 files and write it into txt files. A sample URL with a file can be found here The desired section is between "Item 1a Risk Factors" and "Item 1b". The thing is that the…
2
votes
0 answers

Download 10-k for each corporation, group by SIC code

I am using the EDGAR library in R to download all 2005 10-Ks. The below code will prompt "yes" download the 10-K for each CIK in my loop. (h/t to Weihuang Wong to assisting me with this.) install.packages("edgar") library(edgar) report <-…
user7317101
2
votes
1 answer

dataset for 13F/13G SEC filing

I am looking for a database that contains the 13F/13G filings in Quandl but can't find any. Maybe I am not using the right keywords? Any suggestion where to find a curated dataset? I don't want to end up scraping EDGAR again. Cheers!
user2471214
  • 729
  • 9
  • 17
1
vote
0 answers

How to get 10k fillings for the russell 3000 constituents using sec-api?

Is it possible to fetch the 10-K filings using something like the following? from sec_api import QueryApi query_api = QueryApi(api_key='YOUR_API_KEY') # Define search parameters index_ticker = 'RUA' # Russell 3000 index ticker symbol filing_type…
Ayodele Ashad
  • 87
  • 1
  • 9
1
vote
1 answer

Accessing filing data from SEC EDGAR API in python

I want to pull reports from the SEC EDGAR API and conduct analysis within python. From what I can tell, it looks like the main issue is that Im using the wrong file format, but methods I have found to convert to HTML did not work. I have limited…
SPR26
  • 11
  • 3
1
vote
3 answers

Inconsistent tags between XBRL files from the SEC (EDGAR)

I'm parsing every XBRL files from the SEC through EDGAR in order to retrieve some data (in json format on python). I have no problem parsing those files. My problem lies in the structure of the XBRL files provided by the SEC, i noticed that some…
Omar Sow
  • 15
  • 3
1
vote
3 answers

Syncing another company's financial year to another company using EDGAR database

I'm working on a school project that was built by a previous group, and one of my tasks is to synchronize the fiscal year's start date according to the start date of the company of focus. This is the code used to grab the json from EDGAR using their…
MG0310
  • 11
  • 1
1
vote
2 answers

Can ExtractorApi in sec-api module be used for 10-Q filings?

I am trying to extract specific sections from the 10-Q report using ExtractorApi from sec-api module. The module works for 10-K, however, it fails with certain sections for the 10-Q. For example, if I want to extract item 3 from 10-Q, the following…
1
vote
1 answer

Build financial table from EDGAR XBRL files

When companies submit their reports to SEC, a number of files are made available. E.g. the latest 10-K of AAPL. From this even the SEC website (and many others) make the tables as structured data. What I would like to do is to reproduce it myself,…
jollytall
  • 117
  • 8
1
vote
1 answer

How should I scrape an idx file on EDGAR?

I have an idx file: https://www.sec.gov/Archives/edgar/daily-index/2020/QTR4/master.20201231.idx I could open the idx file with following codes one year ago, but the codes don't work now. Why is that? How should I modify the code? import…
Julie
  • 57
  • 4
1
vote
1 answer

Python Edgar asks for useragent

I ran the following command: import edgar import pandas as pd edgar.download_index('/Users/myusername/Desktop/Desktop', 2010,skip_all_present_except_last=False) It's throwing the following error asking for user_agent. Please clarify where can I get…
Thayyib PV
  • 11
  • 2
1
vote
0 answers

Download multiple 10-ks documents

I need to download multiple 10-ks documents, however, this code works fine if i download the 10-ks between 5-10 companies. But if i increase the number of companies in [cik_lookup function]. Here's code. import nltk import numpy as np import pandas…
Researcher
  • 149
  • 1
  • 8
1
vote
0 answers

Issue with visualising data scraped with edgar library in python

I am currently trying to use the edgar library in Python to download S1 filings from the SEC EDGAR database. I tried running the example code that is given in the library project description, which is as follows: company = edgar.Company("Oracle…
pbosani
  • 71
  • 3
1
vote
0 answers

Does "Consolidated Financial Data" and "Form 8-K" and similar legal terms count as named entities (for Named Entity Recognition)?

Thanks for helping. I am doing some named entity tagging and came across a few ambiguous terms. I follow mostly the CoNLL2003 annotation guidelines and MUC-7 named entity definition (other annotation guidelines mostly share the same idea with these…
Terence Au
  • 11
  • 2