Category Archives: General

Apache Pig and Distributed Cache

Often while building UDFs for Apache Pig, you need to copy files/libraries to all mapper or reducers so that they can be accessed via your UDFs. In this post, I show how to copy files/archives to distributed cache and create a symbolic link to them so that they can accessed through the UDF. Continue reading

Rate this:

Posted in General, Hadoop, Programming | Tagged , | 5 Comments