Oct
29
2015
SQL // Hadoop

Uniqueidentifier data type in SQL Server not supported in Hive

Recently, while using Sqoop to pull data into Hadoop from MS SQL Server, I found an issue with a table whose primary key was a uniqueidentifer column (GUID).  The problem was nicely documented on StackOverflow here.

As a result I’ve taken the approach of designing the data load routines in Java to alternate between using –split-by and –num-mappers depending the table schemas and it is working beautifully.

Calendar

<<  June 2017  >>
MoTuWeThFrSaSu
2930311234
567891011
12131415161718
19202122232425
262728293012
3456789

View posts in large calendar

Page List

    RecentComments

    None