3

Need to pull a Google sheet through Datafusion.

There is some documentation but it does not provide a practical example of how to configure the fields.

Currently I am receiving a 403 Forbidden error in Directory Identifier, service account details space, though all access is provided.

Wytrzymały Wiktor
  • 11,492
  • 5
  • 29
  • 37
RaptorX
  • 113
  • 10
  • Updated the parameters and tried Validation. The Directory Identifier field gives the following error, even though the GSheet has public visibility - 403 Forbidden { "code" : 403, "errors" : [ { "domain" : "global", "message" : "Insufficient Permission: Request had insufficient authentication scopes.", "reason" : "insufficientPermissions" } ], "message" : "Insufficient Permission: Request had insufficient authentication scopes." } Provide an existing folder identifier. – RaptorX May 20 '21 at 16:12
  • Please add all additional information to your question. It will be more visible in question than comment. Could you provide which steps did you follow? When you received this issue? – PjoterS May 21 '21 at 15:35
  • @PjoterS The Google Sheet source plugin has following input parameters (Mentioning the ones I have some confusing about- Directory Identifier\Filter\Authentication Type\Metadata Extraction. Among these Directory Identifier- inserted key from gdrive folder URL(Do we need to publish the directory/file?). Authentication Type-Service Account(The file itself is unrestricted). Filter-Gave the key from the actual file URL. Metadata File Name-Not sure, I am aiming the schema columns to be the sheet first row headers. the given error is appearing on validation in Directory and SA – RaptorX May 23 '21 at 07:13

1 Answers1

0

The error you mention is related to Permissions. Service account should have the proper permission to read from Google sheet and then to write to BigQuery. This is one of the setups where the sheet needs to be shared with the service account or the user who wants to access it.

I guess you won't be able to provide the whole configuration so I will advise you to check Cloud Data Fusion Access Control Documentation to set proper permissions to your needs.

Also please be aware that there are still improvements in progress regarding CDAP Google drive/sheet plugins, which can be found here.

PjoterS
  • 12,841
  • 1
  • 22
  • 54
  • Permissions have been provided to the sheet, to the datafusion service account, to the compute service account required for running datafusion job, and the sheet is publicly hosted to enable least security requirements as well. I presume the plugin needs some specific inputs which is causing the error listed above, but not able to find any specific write up on how to use it. – RaptorX Jun 01 '21 at 16:03
  • Did you provide the service account file path? – Minesh Barot Feb 12 '22 at 01:22