1

I have data collected from different devices A, B, C and all data were recorded in the format of

Table 1 from device A:
Longtitude Latitude Feature1 Feature2 Feature3
XX.xxx     XX.xxx   10.00    20.00    30.00   
---
many rows
Table 2 from device B:
Longtitude Latitude Feature3 Feature4 Feature5
XX.xxx     XX.xxx   1.00    2.00    3.00    
---
many rows
Table 3 from device C:
Longtitude Latitude FeatureX FeatureY
XX.xxx     XX.xxx   5        6
---
many rows

How can I get a table recording all features from the nearest location? so that I can make regression using the features. R or Python Prefered. Thanks.

Feature1 Feature2 Feature3 Feature3 Feature4 Feature5 FeatureX FeatureY
10.00    20.00    30.00    1.00     2.00     3.00     5        6
---
Many Rows 
user438383
  • 5,716
  • 8
  • 28
  • 43
Wenyao Leo
  • 11
  • 4
  • 2
    Share the code you have written till now – ss3387 Sep 06 '22 at 17:50
  • 1
    Currently, I do it on small data samples manually with Excel using formulas to minimize the GPS coordinates. It is not a good way when encountering large datasets. – Wenyao Leo Sep 06 '22 at 20:47

1 Answers1

0

The sjoin_nearest function of GeoPandas looks interesting.

Ture Pålsson
  • 6,088
  • 2
  • 12
  • 15
  • Thank you. I found a very helpful tutorial that solved my problem. https://autogis-site.readthedocs.io/en/latest/notebooks/L3/06_nearest-neighbor-faster.html – Wenyao Leo Sep 20 '22 at 16:03