Questions tagged [spotify-scio]

Scio is a Scala API for Google Cloud Dataflow and Apache Beam inspired by Spark and Scalding.

80 questions
0
votes
1 answer

Scala macros: string literal assigned to a variable does not match

I am using a macro annotation from Spotify's Scio library. I would like to define a variable of String type and annotate like this: val schemaString = """schema here""" @BigQueryType.fromSchema(outputString) class BigQuery This does not compile,…
jamborta
  • 5,130
  • 6
  • 35
  • 55
0
votes
0 answers

Scio: How can I combine messages sent from cloud pub sub using Apache Beam?

I am currently using Apache Beam's scala wrapper library, scio. The thing you want to do is combine different types of messages sent from CloudPubSub based on ID. The message A is sent every second, and the message B is sent once every three…
0
votes
2 answers

SBT Verify Error caused by multiple protobuf 2/3 dependencies in spite of shading

I am struggling with Verify Errors with this below sample project using Scio/Bigtable/HBase. The dependency tree requires protobuf version (2.5, 2.6.1, 3.0, 3.1) and seems to default to 3.2. I used the shading component of sbt-assembly, not sure I…
ogen
  • 802
  • 2
  • 7
  • 23
0
votes
1 answer

Transform HBase Scan to RowFilter

I'm using scio from spotify for my Dataflow jobs. In last scio version, new bigtable java api is used (com.google.bigtable.v2) Now scio bigtable entry point required "RowFilter" to filter instead of Hbase "Scan". Is there a simple way to transform…
harscoet
  • 340
  • 1
  • 11
0
votes
2 answers

How to save SCollection elements to different BigQuery tables?

I need to save SCollection elements into different hourly BigQuery tables on the basis of their timestamps. I tried the following methods - Group elements by (TableName, Iterable[TableRow]) and then save each Iterable[TableRow] to their respective…
Kakaji
  • 1,421
  • 2
  • 15
  • 23
1 2 3 4 5
6