Questions tagged [u-sql]

U-SQL is a query language designed for Azure Data Lake. It provides a way to mingle SQL keywords with syntactic C# expressions, so that within a single script, a programmer can schematize the data from an unstructured source, use SQL to aggregate the data into the desired form, and then write the output to a file or table.

771 questions
3
votes
2 answers

How can I log something in USQL UDO?

I have custom extractor, and I'm trying to log some messages from it. I've tried obvious things like Console.WriteLine, but cannot find where output is. However, I found some system logs in…
arghtype
  • 4,376
  • 11
  • 45
  • 60
3
votes
3 answers

U-SQL Paralell reading from SQL Table

I have a scenario in which I am ingesting data from a MS SQL DB into Azure Data Lake using U-SQL. My table is quite big, with over 16 millions records (soon it will be much more). I just do a SELECT a, b, c FROM dbo.myTable; I realized, however,…
candidson
  • 516
  • 3
  • 18
3
votes
4 answers

Generate dates in a date range using U-SQL

I need to populate a rowset with all the dates between a defined Start date and End Date. If my start date is 19/7/2017 and the end date is 21/7/2017 then the rowset should contain 19/7/2017, 20/7/2017 and 21/7/2017. I was wondering if there was an…
3
votes
1 answer

u-SQL - is it possible to reference a registered assembly from a code-behind file

Registered assemblies can be used in u-SQL script files by using REFERENCE ASSEMBLY. Is it possible to reference a registered assembly in a c# code behind file?
Liz Nett
  • 139
  • 2
  • 7
3
votes
1 answer

Value too long failure when attempting to convert column data

Scenario I have a source file that contains blocks of JSON on each new line. I then have a simple U-SQL extract as follows where [RawString] represents each new line in the file and the [FileName] is defined as a variable from the @SourceFile…
Paul Andrew
  • 3,233
  • 2
  • 17
  • 37
3
votes
1 answer

Cannot access Azure Key Vault from Azure Data Lake Analytics

I have an U-SQL script with custom extractor, which access Azure Key Vault to get some credentials. I followed this tutorial. And I have equivalent code to get token from AD and then to call provided URI for actual credentials: public static async…
arghtype
  • 4,376
  • 11
  • 45
  • 60
3
votes
1 answer

U-SQL job performance

Could you help me with the job performance? I runned it with 10 AUs. And at first part of time they are used almost all. But from the second half of the execution time it uses only 1 AU. I see in the plan a one supervertex consists from only one…
churupaha
  • 325
  • 2
  • 10
3
votes
2 answers

How to merge base and multiple delta structured streams in Azure DataLake

I have a base stream and I have multiple delta streams for different consecutive dates. I want to merge them to get the final stream. How can I accomplish this in Azure Datalake. For example assume these are the stream. I need to merge these streams…
user2580488
  • 73
  • 10
3
votes
2 answers

U-SQL Split a CSV file to multiple files based on Distinct values in file

I have the Data in Azure Data Lake Store and I am processing the data present there with Azure Data Analytic Job with U-SQL. I have several CSV files which contain spatial data, similar to this: File_20170301.csv longtitude| lattitude | date …
FeodorG
  • 178
  • 2
  • 10
3
votes
1 answer

U SQL: direct output to SQL DB

Is there a way to output U-SQL results directly to a SQL DB such as Azure SQL DB? Couldn't find much about that. Thanks!
candidson
  • 516
  • 3
  • 18
3
votes
2 answers

Does ROWCOUNT hint works for EXTRACT in U-SQL

I want to allocate more vertexes to the extraction job, tried using ROWCOUNT hint, it doesn't seem to work, no matter what value I use for ROWCOUNT, U-SQL always allocate the same number of vertexes. EXTRACT xxxx FROM @"Path" USING new…
lidong
  • 556
  • 1
  • 4
  • 20
3
votes
1 answer

u-sql job is very slow, when i add a .NET call

The code performs very fast over 2000 small files (~10-50 Kb) ~ 1 min. Parallelizm = 5. @arenaData = EXTRACT col1, col2, col3 FROM @in USING Extractors.Tsv(quoting : true, skipFirstNRows : 1, nullEscape : "\\N",…
churupaha
  • 325
  • 2
  • 10
3
votes
1 answer

What do priority and parallelism value mean in Azure Data Lakes (Hadoop)?

In other words, what does a parallelism value of 5 and a priority value of 1000 mean?
Justin Borromeo
  • 1,201
  • 3
  • 13
  • 26
3
votes
1 answer

Bad Request.,Source=Microsoft.WindowsAzure.Storage,StorageExtendedMessage=Block blobs are not supported

I am trying to create an azure data factory by following this article Create, monitor, and manage Azure data factories using Data Factory .NET SDK Now I am getting following error: Copy activity encountered a user error at Sink side:…
Kishan Gupta
  • 586
  • 1
  • 5
  • 18
3
votes
1 answer

Is it possible to use U-SQL managed tables as output datasets in Azure Data Factory?

I have a small ADF pipeline that copies a series of files from an Azure Storage Account to an Azure Data Lake account. As a final activity in the pipeline I want to run a U-SQL script that uses the copied files as inputs and outputs the result to a…
soderstromOlov
  • 384
  • 1
  • 5
  • 11