Always keep the larger table on the right side of joins
When doing JOINs on HIVE, remember to keep the larger size table on the right of the expression, that'll make it be streamed through the reducers, instead of being cached in memory (as does the left side table) avoiding that horrible Heap Space issue that you'll run into.
Written by Fernando Doglio
Related protips
Have a fresh tip? Share with Coderwall community!
Post
Post a tip
Best
#Hadoop
Authors
devtripper
37.79K
kh1ramatsu
8.992K
Sponsored by #native_company# — Learn More
#native_title#
#native_desc#