Questions tagged [edgar]

EDGAR is an information system of the U.S. Securities and Exchange Commission holding company data. Questions related to parsing and querying the data and public APIs should be tagged.

EDGAR stays for Electronic Data Gathering, Analysis, and Retrieval. This information system uses several data formats: classic SGML based, XML-based XBRL format for business reporting and many more.

120 questions
0
votes
0 answers

Download .txt from URL string in R

I am working with the EDGAR package in R to download Apple's 2005 Annual Report. This is my code to get that far: library(edgar) getMasterIndex(2005) aapl<-getFilings(2005,320193, '10-K') This was my output when I did that: >…
user7317101
0
votes
2 answers

String Matching in Python?

I'm having trouble matching strings in Python. What I'm trying to do is look for lines in documents like this and try to match each line to specific phrases. I'm reading in all the lines and parsing with Beautfiul soup into stripped strings, then…
Retroflux
  • 57
  • 1
  • 1
  • 9
0
votes
1 answer

Generate EDGAR FTP File Path List

I'm brand new to programming (though I'm willing to learn), so apologies in advance for my very basic question. The [SEC makes available all of their filings via FTP][1], and eventually, I would like to download a subset of these files in bulk.…
user53526356
  • 934
  • 1
  • 11
  • 25
0
votes
0 answers

How to create tabular output in python

Currently, I'm looking to scrape the signatures table from the edgar filings for specific companies. I have created a Python program to get down into each document and finds the tables that I need to scrape. I'm having trouble figuring out how to…
Retroflux
  • 57
  • 1
  • 1
  • 9
0
votes
1 answer

Generating a Fact Table from a presentation linkbase

This is a follow up question from this SO question. Having now created the Model Structure from the presentation linkbase found here, I now would like to extract the Fact Table information. The model structure…
pdoak
  • 721
  • 8
  • 21
0
votes
1 answer

FTP 550 Error: No such file or directory in EGDAR FTP server

I am trying to download all the master.idx files of different quarters in different years. The EDGAR FTP is structured like this: "edgar/full-index/2011/QTR3/", and in the QTR3 folder is the master.idx file. I tried to access the folder of…
0
votes
1 answer

Edgar (finstr): error: cannot join on columns 'elementId' x 'elementId' - R

Ran following: library(finstr) inst<- "http://edgar.sec.gov/Archives/edgar/data/320193/000119312514383437/aapl-20140927.xml" xbrl.vars<- xbrlDoAll(inst, verbose=TRUE) st2013<-xbrl_get_statements(xbrl.vars) Recevied following error Error: cannot…
lpt
  • 931
  • 16
  • 35
-1
votes
1 answer

Web scraping for multiple classes using python

I am trying to scrape address from 10K filing document in HTML: https://www.sec.gov/Archives/edgar/data/1652044/000165204419000032/goog10-qq32019.htm It has multiple div class, and I want to scrape for address inside span. Expected output: 1600…
-1
votes
2 answers

What does the `end` property mean in the XBRL data?

I'm trying to read some programmatic API data from EDGAR, and I'm having trouble understanding the end property. For example, when looking at AssetsCurrent: "end": "2018-03-31", "val": 206935000000, "accn": "0001652044-18-000016", "fy": 2018, "fp":…
jayjyli
  • 771
  • 3
  • 11
  • 23
-1
votes
2 answers

Download a txt file from EDGAR

I want to download this file to my local drive: https://www.sec.gov/Archives/edgar/data/1556179/0001104659-20-000861.txt Here are my codes: import requests import urllib from bs4 import BeautifulSoup import re …
Julie
  • 57
  • 4
-1
votes
1 answer

BeautifulSoup: Get the HTML Code of Modal Footer

I'm new to Web scraping in Python and try to scrape all htm document-links from an SEC Edgar full-text search. I can see the link in the Modal Footer, but BeautifulSoup won't parse the href Element with the link. Is there an easy solution to parse…
Steve
  • 1
-1
votes
1 answer

Scraping EDGAR with Python codes (Program 2) not working

I tried the python codes from the article of Rasha Ashraf "Scraping EDGAR with Python". Yesterday I got helped from you great developer(s). Specially Thanks for Jack Fleeting. The links related to this problem are as follows: Text Scraping (from…
-1
votes
2 answers

how to use readlines()

Here on running the following code I am getting an error as below. Can anyone please help me with this. import urllib import requests url = "https://www.sec.gov/Archives/edgar/data/1800/000104746918000856/0001047469-18-000856.txt" page =…
Berlin
  • 7
  • 4
-1
votes
1 answer

SAS Regex code to capture Business Address from 10-K company filings

Consider the following EDGAR 10-K SEC Company Filing https://www.sec.gov/Archives/edgar/data/912382/000136231009004179/0001362310-09-004179.txt BUSINESS ADDRESS: STREET 1: 107 N PENNSYLVANIA ST STREET 2: STE 600 CITY: …
-2
votes
2 answers

Efficient downloading of 10-K filings from SEC website

I use the following perl code to mass download 10-Ks from the SEC website. However, I get an "Out of memory!" message every few hundred files when the script apparently gets stuck processing an especially large 10-K filing. Any ideas how I can…
Rick
  • 107
  • 2
  • 12
1 2 3 4 5 6 7
8