Theres a data set of size 200M how to get random sample data(of size 100rows) efficiently using SQLalchemy or any other possible way.
Asked
Active
Viewed 1,056 times
1
-
filter for 1 partition. otherwise you can't because redshift is columnar – drum Jul 27 '22 at 13:47
-
please post the schema of the table. – damjad Jul 27 '22 at 14:23
-
How is this different to your [previous question](https://stackoverflow.com/questions/73084062/is-there-any-way-to-select-rows-from-a-table-without-using-offset-or-orderby)? – John Rotenstein Jul 28 '22 at 01:26
-
2Use `ORDER BY RANDOM() LIMIT 100` – John Rotenstein Jul 28 '22 at 01:27
-
Please provide enough code so others can better understand or reproduce the problem. – Community Jul 28 '22 at 03:39