Questions tagged [palantir-foundry]

Palantir Foundry is a web-based data analytics and decision modeling SaaS platform. Use this tag for questions about building your own models in Foundry using Python, R, or SQL or working with the Foundry API.

Palantir Foundry is a web-based data analytics and decision modeling SaaS platform. Use this tag for questions about building your own models in Foundry using Python, R, or SQL or working with the Foundry API.

731 questions
3
votes
1 answer

TODAY() Function in Foundry Fusion Sheets

I am pretty new to Fusion Sheets and want to prepare some data. As it seems, there is no TODAY() function existing as far as I could tell. So I was thinking of creating a custom function in order to do so. But the documentation is not giving any…
Eiko
  • 31
  • 1
3
votes
2 answers

What are best practices for minimizing CI Check times within a Code Repository?

Are there things I can do to reduce the amount of time it takes for Code Repo checks to run?
3
votes
1 answer

How to have flexible grouping column in Foundry Functions?

In my Workshop application, I would like to have a bar chart with a changeable x-axis. A dropdown widget would be used to select the desired x-axis. For this I am writing a TypeScript function, that will return the data that will feed the chart…
Eduard Jesko
  • 198
  • 1
  • 1
  • 7
3
votes
3 answers

Palantir Foundry How to allow dynamic number of input in compute (Code repository)

I have a folder where I will upload one file every month. The file will have the same format in every month. First problem The idea is to concatenate all the files in this folder into one file. Currently I am hardcoding the filenames (filename[0],…
infinity911
  • 201
  • 1
  • 9
3
votes
1 answer

Shuffle Stage Failing Due To Executor Loss

I get the following error when my spark jobs fails **"org.apache.spark.shuffle.FetchFailedException: The relative remote executor(Id: 21), which maintains the block data to fetch is dead."** Over view of my spark job input size is ~35 GB I have…
3
votes
1 answer

How do I know my Foundry Job is using AQE?

I hear people mention this AQE feature sometimes and I'm wondering how to verify if my job is using it or not. I'm running transformations both in Code Repositories and Code Workbooks.
3
votes
1 answer

How can I use sqlContext (to execute SQL queries) in the Python transform?

I have done the following in Code Repositories @transform_df( Output(test_dataset_path), df=Input(og_dataset_path) ) def compute(ctx, df): ctx.spark_session.sql(f''' CREATE TABLE `test_dataset_path` AS SELECT * FROM…
3
votes
2 answers

Can I put a schedule on the build rather than the dataset?

I want to be able to generate a new dataset with each build, where the current date is appended to the name, like so: dataset_output_2021-11-27 dataset_output_2021-11-28 dataset_output_2021-11-29 Is it possible to put a schedule on the build rather…
3
votes
1 answer

How to get complete metadata of dataset in Palantir Foundry through API call?

I want to fetch complete metadata of the given dataset through API call. Can anyone please suggest how to fetch metadata
3
votes
1 answer

How can I merge an incremental dataset and a snapshot dataset while retaining deleted rows?

I have a data connection source that creates two datasets: Dataset X (Snapshot) Dataset Y (Incremental) The two datasets pull from the same source. Dataset X consists of the current state of all rows in the source table. Dataset Y pulls all rows…
3
votes
1 answer

How do I union two datasets in Palantir Foundry within a code workbook?

I need to UNION two datasets in a Code Workbook of Palantir Foundry and I'm not sure how to do that. I want to use Pyspark to do this. I'm new to Foundry, please help!
3
votes
1 answer

Palantir Foundry incremental testing is hard to iterate on, how do I find bugs faster?

I have a pipeline setup in my Foundry instance that is using incremental computation but for some reason isn't doing what I expect. Namely, I want to read the previous output of my transform and get the maximum value of a date, then read the input…
3
votes
2 answers

Cannot find module '@foundry/ontology-api' or its corresponding type declarations

When opening a branch in a Typescript repository, the following error appears against one of the modules @foundry/ontology-api in the import statement: [typescript] Cannot find module '@foundry/ontology-api' or its corresponding type…
ivor
  • 136
  • 3
  • 8
3
votes
1 answer

orderBy and sort is not applied on the full dataframe

The final result is sorted on column 'timestamp'. I have two scripts which only differ in one value provided to the column 'record_status' ('old' vs. 'older'). As data is sorted on column 'timestamp', the resulting order should be identic. However,…
3
votes
2 answers

How to distribute N rows into X groups and to attribute a value D in PySpark?

what I want to do : In PySpark, I am trying to distribute N rows into X groups of same size and to attribute a specific value D to those groups. Each row consist of attributes A, B, C (reference, item, location) where all A are unique, but not B &…
ionah
  • 81
  • 1
  • 8