Oct
29
2015
SQL // Hadoop

Uniqueidentifier data type in SQL Server not supported in Hive

Recently, while using Sqoop to pull data into Hadoop from MS SQL Server, I found an issue with a table whose primary key was a uniqueidentifer column (GUID).  The problem was nicely documented on StackOverflow here.

As a result I’ve taken the approach of designing the data load routines in Java to alternate between using –split-by and –num-mappers depending the table schemas and it is working beautifully.

Add comment

biuquote
  • Comment
  • Preview
Loading

Calendar

<<  December 2017  >>
MoTuWeThFrSaSu
27282930123
45678910
11121314151617
18192021222324
25262728293031
1234567

View posts in large calendar

Page List

    RecentComments

    None