Questions tagged [edgar]

EDGAR is an information system of the U.S. Securities and Exchange Commission holding company data. Questions related to parsing and querying the data and public APIs should be tagged.

EDGAR stays for Electronic Data Gathering, Analysis, and Retrieval. This information system uses several data formats: classic SGML based, XML-based XBRL format for business reporting and many more.

120 questions

votes

0 answers

Download .txt from URL string in R

I am working with the EDGAR package in R to download Apple's 2005 Annual Report. This is my code to get that far: library(edgar) getMasterIndex(2005) aapl<-getFilings(2005,320193, '10-K') This was my output when I did that: >…

asked Feb 03 '17 at 16:31

user7317101

votes

2 answers

String Matching in Python?

I'm having trouble matching strings in Python. What I'm trying to do is look for lines in documents like this and try to match each line to specific phrases. I'm reading in all the lines and parsing with Beautfiul soup into stripped strings, then…

python string python-3.x string-matching edgar

asked Jun 03 '16 at 14:39

Retroflux

votes

1 answer

Generate EDGAR FTP File Path List

I'm brand new to programming (though I'm willing to learn), so apologies in advance for my very basic question. The [SEC makes available all of their filings via FTP][1], and eventually, I would like to download a subset of these files in bulk.…

ftp edgar

asked May 22 '16 at 16:39

user53526356

votes

0 answers

How to create tabular output in python

Currently, I'm looking to scrape the signatures table from the edgar filings for specific companies. I have created a Python program to get down into each document and finds the tables that I need to scrape. I'm having trouble figuring out how to…

python-3.x beautifulsoup edgar

asked May 13 '16 at 15:36

Retroflux

votes

1 answer

Generating a Fact Table from a presentation linkbase

This is a follow up question from this SO question. Having now created the Model Structure from the presentation linkbase found here, I now would like to extract the Fact Table information. The model structure…

xml xbrl edgar

asked Mar 31 '16 at 14:05

pdoak

votes

1 answer

FTP 550 Error: No such file or directory in EGDAR FTP server

I am trying to download all the master.idx files of different quarters in different years. The EDGAR FTP is structured like this: "edgar/full-index/2011/QTR3/", and in the QTR3 folder is the master.idx file. I tried to access the folder of…

python ftp web-scraping edgar

asked Oct 07 '15 at 16:37

Shijie Wang

votes

1 answer

Edgar (finstr): error: cannot join on columns 'elementId' x 'elementId' - R

Ran following: library(finstr) inst<- "http://edgar.sec.gov/Archives/edgar/data/320193/000119312514383437/aapl-20140927.xml" xbrl.vars<- xbrlDoAll(inst, verbose=TRUE) st2013<-xbrl_get_statements(xbrl.vars) Recevied following error Error: cannot…

r xbrl edgar

asked Jul 15 '15 at 19:05

lpt

-1

votes

1 answer

Web scraping for multiple classes using python

I am trying to scrape address from 10K filing document in HTML: https://www.sec.gov/Archives/edgar/data/1652044/000165204419000032/goog10-qq32019.htm It has multiple div class, and I want to scrape for address inside span. Expected output: 1600…

python web-scraping web beautifulsoup edgar

asked Feb 03 '23 at 09:35

Sushmitha Krishnan

-1

votes

2 answers

What does the `end` property mean in the XBRL data?

I'm trying to read some programmatic API data from EDGAR, and I'm having trouble understanding the end property. For example, when looking at AssetsCurrent: "end": "2018-03-31", "val": 206935000000, "accn": "0001652044-18-000016", "fy": 2018, "fp":…

json edgar sec

asked Jun 26 '22 at 08:36

jayjyli

-1

votes

2 answers

Download a txt file from EDGAR

I want to download this file to my local drive: https://www.sec.gov/Archives/edgar/data/1556179/0001104659-20-000861.txt Here are my codes: import requests import urllib from bs4 import BeautifulSoup import re …

python download edgar

asked Jan 16 '22 at 14:22

Julie

-1

votes

1 answer

BeautifulSoup: Get the HTML Code of Modal Footer

I'm new to Web scraping in Python and try to scrape all htm document-links from an SEC Edgar full-text search. I can see the link in the Modal Footer, but BeautifulSoup won't parse the href Element with the link. Is there an easy solution to parse…

python html-parsing edgar

asked Sep 30 '21 at 16:48

Steve

-1

votes

1 answer

Scraping EDGAR with Python codes (Program 2) not working

I tried the python codes from the article of Rasha Ashraf "Scraping EDGAR with Python". Yesterday I got helped from you great developer(s). Specially Thanks for Jack Fleeting. The links related to this problem are as follows: Text Scraping (from…

python-3.x url scrape edgar sec

asked Nov 13 '20 at 21:53

Jason SJ Yim

-1

votes

2 answers

how to use readlines()

Here on running the following code I am getting an error as below. Can anyone please help me with this. import urllib import requests url = "https://www.sec.gov/Archives/edgar/data/1800/000104746918000856/0001047469-18-000856.txt" page =…

python python-3.x readlines edgar

asked May 02 '20 at 18:36

Berlin

-1

votes

1 answer

SAS Regex code to capture Business Address from 10-K company filings

Consider the following EDGAR 10-K SEC Company Filing https://www.sec.gov/Archives/edgar/data/912382/000136231009004179/0001362310-09-004179.txt BUSINESS ADDRESS: STREET 1: 107 N PENNSYLVANIA ST STREET 2: STE 600 CITY: …

regex sas finance edgar

asked May 06 '17 at 16:02

Martin D.

-2

votes

2 answers

Efficient downloading of 10-K filings from SEC website

I use the following perl code to mass download 10-Ks from the SEC website. However, I get an "Out of memory!" message every few hundred files when the script apparently gets stuck processing an especially large 10-K filing. Any ideas how I can…

perl edgar

asked Apr 21 '17 at 20:08

Rick

Prev 1 2 3 4 5 6 7