We see many many options to retrieve data from RDBMS and we validate the data every time if we receive it as dump. Say for a use case where the source can give you access to fetch the full data. one can use the JDBC Hive storage Handler to directly query the JDBC RDBMS data source.

Below is the URL for further Mind tweaking.

 

Official:

https://issues.apache.org/jira/browse/HIVE-1555

https://community.hortonworks.com/articles/4671/sparksql-jdbc-federation.html

 

Forks:

 

https://github.com/qubole/Hive-JDBC-Storage-Handler

 

https://github.com/myui/HiveJdbcStorageHandler

 

https://github.com/QubitProducts/hive-jdbc-storage-handler

 

Advertisements