Oct
29
2015
SQL // Hadoop

Uniqueidentifier data type in SQL Server not supported in Hive

Recently, while using Sqoop to pull data into Hadoop from MS SQL Server, I found an issue with a table whose primary key was a uniqueidentifer column (GUID).  The problem was nicely documented on StackOverflow here.

As a result I’ve taken the approach of designing the data load routines in Java to alternate between using –split-by and –num-mappers depending the table schemas and it is working beautifully.

Calendar

<<  May 2017  >>
MoTuWeThFrSaSu
24252627282930
1234567
891011121314
15161718192021
22232425262728
2930311234

View posts in large calendar

Page List

    RecentComments

    None