Questions tagged [orientdb-etl]

The Extractor Transformer and Loader, or ETL, module for OrientDB provides support for moving data to and from OrientDB databases using ETL processes.

The Extractor Transformer and Loader, or ETL, module for OrientDB provides support for moving data to and from OrientDB databases using ETL processes.

Configuration: The ETL module uses a configuration file, written in JSON. Extractor Pulls data from the source database. Transformers Convert the data in the pipeline from its source format to one accessible to the target database. Loader loads the data into the target database.

http://orientdb.com/docs/last/ETL-Introduction.html

39 questions
4
votes
1 answer

OrientDB ETL loading CSV with vertices in one file and edges in another

I have some data that is in 2 CSV files, one contains the vertices and the other file contains the edges are in the other file. I'm working out how to set this up using ETL and am close but not quite there yet--it mostly works but my edges have…
TxAG98
  • 1,070
  • 2
  • 10
  • 25
4
votes
1 answer

import edges to OrientDB using etl

I have 3 tables, one is for vertex A, one is for vertex B, and the third is for edges from B to A. How can I import this graph to OrientDB? For now, the tutorial just says how to import two csv files, one is for vertex A, and the other one is for…
Cauchy Schwarz
  • 747
  • 3
  • 10
  • 27
2
votes
1 answer

OrientDB data write from CSV files

I was researching OrientDB and it in a single host mode it did not show great performance results. I need to create the table in OrientDB by importing the records from CSV file. Is there any specific command to write single table CSV file into…
Cassie
  • 2,941
  • 8
  • 44
  • 92
2
votes
1 answer

Use ETL to load CSV data into OrientDB containing a SPATIAL index

I'm interested in loading some data into an OrientDB from some CSV files that contain spatial coordinates in WGS84 Lat/Long. I'm using OrientDB 2.2.8 and have the lucene spatial module added to my $ORIENTDB_HOME/lib directory. I'm loading my data…
TxAG98
  • 1,070
  • 2
  • 10
  • 25
2
votes
0 answers

Edge properties clobbering vertex properties in OrientDB from ETL

This is a follow-up to another question I had posted regarding importing a simple database into OrientDB using ETL that had both edge and vertex properties with dates on both. Here's my…
TxAG98
  • 1,070
  • 2
  • 10
  • 25
2
votes
1 answer

OrientDB ETL: how to skip a duplicate vertex but create the edge

I am creating a communication graph. Each message has a msgid and each person has a userid. I have already created the message vertices, now i want to create the user vertices and an edge connecting a message vertex to the user vertex. A user can…
Albert S
  • 2,552
  • 1
  • 22
  • 28
2
votes
1 answer

OrientDB import edges only using ETL tool

I already used the OETL to insert all my Vertex to the graph. Now I have a file that outlines the edges at the following way: node_1,rel_type,node_2 11000001,relation_A,10208879 11000001,relation_A,10198662 11000001,relation_B,10159927 …
Dor Cohen
  • 16,769
  • 23
  • 93
  • 161
1
vote
0 answers

orientdb oetl error / I want to have pokec data

To configure the current pokec db ./oetl.sh I'm trying. However, the pipeline exit keeps occurring. I don't know what the problem is. Help me.The dburl is also a code that was well written but is now erased. I've tried all the settings, but they…
김윤아
  • 11
  • 1
1
vote
1 answer

OrientDB ETL, create edge between two vertices which are already in Graph

I am trying to create an edge between two vertices which are already part of OreintDB. My edge data is in a MySQL table. Here is my oetl json. { "config": { "log": "info" }, "source": { "file": { "path":…
RP-
  • 5,827
  • 2
  • 27
  • 46
1
vote
1 answer

How to import Edges from CSV with ETL into OrientDB graph?

I'm trying to import edges from a CSV-file into OrientDB. The vertices are stored in a separate file and already imported via ETL into OrientDB. So my situation is similar to OrientDB import edges only using ETL tool and OrientDB ETL loading CSV…
1
vote
1 answer

OrientDB ETL create edge using multiple fields in match criteria

I have some data that I'm tracking that looks something like this: node.csv Label1,Label2 Alpha,A Alpha,B Alpha,C Bravo,A Bravo,B The pair Label1 and Label2 define a unique entry in this data set. I have another table that has some values in it…
TxAG98
  • 1,070
  • 2
  • 10
  • 25
1
vote
1 answer

Error while importing beers database using CSV

I have the latest community edition 2.2.17. While importing the beers database using csv I am getting error while importing the beers. (categories, styles etc. all got imported fine). The errors are like: OrientDB etl v.2.2.17 (build…
dbusern
  • 195
  • 3
  • 14
1
vote
0 answers

Orient ETL perfomance issue with importing edges to plocal on SSD

My goal is importing 25M edges in the graph which has about 50M vertices. Target time: The current speed of importing is ~150 edges/sec. Speed on remote connection was about 100 edges/sec. extracted 20,694,336 rows (171 rows/sec) - 20,694,336…
Eugene
  • 448
  • 4
  • 12
1
vote
0 answers

Performance tuning for loading Gigabytes of data in OrientDB

I currently used the ETL tool to insert a bunch of CSV data into OrientDB. The system configuration i used for trial purpose is EC2 M3 large ( 7.5 GiB of memory, 2 vCPUs, 32 GB of SSD-based local instance storage, 64-bit platform ). The data(masked)…
pjesudhas
  • 399
  • 4
  • 13
1
vote
1 answer

ETL loader throws FileNotFoundException: error loading json file

I'm attempting to get the ETL loader to work on OrientDB 2.2.5 on an OSX machine (10.11.6) but I keep getting a FileNotFoundException on the input JSON file... I'm going off the example that can be found in the tutorials for the post.csv and…
TxAG98
  • 1,070
  • 2
  • 10
  • 25
1
2 3