Questions tagged [apache-atlas]

Apache Atlas is a data governance and metadata framework for Hadoop. Use for questions about setting up Atlas, the REST APIs, bridges, or problems encountered using Atlas.

Data Governance and Metadata framework for Hadoop

Features

  • Data Classification

Import or define taxonomy business-oriented annotations for data Define, annotate, and automate capture of relationships between data sets and underlying elements including source, target, and derivation processes Export metadata to third-party systems

  • Centralized Auditing

Capture security access information for every application, process, and interaction with data Capture the operational information for execution, steps, and activities

  • Search & Lineage (Browse)

Pre-defined navigation paths to explore the data classification and audit information Text-based search features locates relevant data and audit event across Data Lake quickly and accurately Browse visualization of data set lineage allowing users to drill-down into operational, security, and provenance related information

  • Security & Policy Engine

Rationalize compliance policy at runtime based on data classification schemes, attributes and roles. Advanced definition of policies for preventing data derivation based on classification (i.e. re-identification) – Prohibitions Column and Row level masking based on cell values and attibutes.

References:

107 questions
0
votes
1 answer

How to setup Apache Atlas using embedded cassandra and Apache Solr

Step 1: Clone the repository. git clone https://github.com/apache/atlas Step 2: Generated tar file by executing below command mvn clean -DskipTests package -Pdist,embedded-cassandra-solr Step 3: Once the build is successful, extracted…
Hari Krishna
  • 3,658
  • 1
  • 36
  • 57
0
votes
1 answer

apache atlas - hook hive - how to build apache-atlas-${project.version}-hive-hook.gz?

I want to hook Hive for importing metadata automatically in atlas that is installed standalone. Instruction on apache atlas website says "untar apache-atlas-${project.version}-hive-hook.tar.gz" and I can't find this file. I'v tried to build…
0
votes
0 answers

How to configure apache Flink with Apache atlas for lineage?

I am looking for integrating Apache Atlas with Apache Flink to capture Job lineage. I found some references around it from Cloudera (CDP) and they are using Atlas-flink hook , But not able to find any documentation or implementation without CDP and…
Sanjay Bagal
  • 99
  • 1
  • 8
0
votes
0 answers

Unable to execute import-hive.sh

I am getting below error while running import-hive.sh Could you please help me out on this? hadoop@0.0.0.0:~/apache-atlas-2.1.0/hook/apache-atlas-hive-hook-2.1.0/hook-bin$ ./import-hive.sh Using Hive configuration directory…
venkat
  • 1
0
votes
1 answer

spline spark agent jar has errors during post processing

I have been trying to run the following code with the new spline jsr: za.co.absa.spline.agent.spark:spark-3.0-spline-agent-bundle_2.12:0.6.0 but have been getting errors specific to UserExtraMetadataProvider which has been deprecated in the newer…
0
votes
1 answer

Not able to get desired search results in ElasticSearch search api

I have field "xyz" on which i want to search. The type of the field is keyword. The different values of the field "xyz "are - a/b/c/d a/b/c/e a/b/f/g a/b/f/h Now for the following query - { "query": { "query_string" : { …
0
votes
1 answer

How can i insert list of Classifications into Apache Atlas using API

I am trying to insert set of classifications or tags into Apache Atlas using API. I am facing an error. Can any one help me any roundabouts of overcoming this error and insert the classifications. My code: import requests import json from…
usr_lal123
  • 650
  • 12
  • 28
0
votes
1 answer

Apache Atlas: curl: (7) Failed to connect to localhost port 21000: Connection refused

I'm trying to run apache atlas on my local. There are several problem I have faced to. First, for clearance of how I have build the apache atlas I will describe the steps: git clone https://github.com/apache/atlas cd atlas mvn clean install…
Mostafa Ghadimi
  • 5,883
  • 8
  • 64
  • 102
0
votes
1 answer

can't build Apache Atlas

I'm trying to build Apache Atlas from the main repository. As it is described in the README.md file, after cloning the repository and changing the current directory to atlas, I am trying to build using mvn clean install command. Unfortunately, since…
Mostafa Ghadimi
  • 5,883
  • 8
  • 64
  • 102
0
votes
1 answer

Access to the creation of Glossary, Terms, Category, and Classification by different types of users. Apache Atlas

Please tell me if сan all types of users (e.g., admin, data scientist...) create the Glossary, Term, Category, and Classification in Apache Atlas? I would also like to know whether it's possible to make any restrictions for some users for these…
IgorB
  • 25
  • 4
0
votes
2 answers

Unable to find other entity-type in Apache Atlas . Only Showing hdfs_path

Hi I am new to Apache Atlas . And I am facing a problem. I want to create a hive_table entity type manually but in Entity type drop downs its showing only "hdfs_path" Can anyone let me know how I can use a custom entity type in apache Atlas. And can…
0
votes
1 answer

In Apache Atlas, is there a way to delete all the entities?

During debug of some integrations with Apache Atlas it gets very messy (eg. a lot of almost the same entities but with just a bit different name). Is there is a way to delete all of them without deleting the whole graph database? Because in that…
Albert Bikeev
  • 375
  • 6
  • 16
0
votes
1 answer

How to install apache atlas on Azure HDInsight?

I tried adding apache infra in ambari interface but add service link is not responding.
Ayushi
  • 13
  • 3
0
votes
1 answer

Load properties file in Spark classpath during spark-submit execution

I'm installing the Spark Atlas Connector in a spark submit script (https://github.com/hortonworks-spark/spark-atlas-connector) Due to security restrictions, I can't put the atlas-application.properties in the spark/conf repository. I used the two…
ttan
  • 1
0
votes
3 answers

No sample data added to Apache Atlas Server: running apache quick_start.py

I have installed Apache-Atlas with embedded-hbase-solr on REHL. I am able to access http://localhost:21000 but when I run /apache-atlas-sources-2.0.0/distro/target/apache-atlas-2.0.0/bin/quick_start.py it throws below error. log4j:WARN No such…
Vidya
  • 154
  • 1
  • 17