Best unofficial Apache Server developers community
Username
Forgot password?
Sign in with Twitter account
Sign in with Facebook account
List archives

Pig Recent threads

Threads Replies First post Last post
in statement
By: Fabian Alenius
Hi, lets say I have a large tuple or a bag and I want to see if one of the fields match a string. How would one do that? Similarly how do you apply a function to all the fields in a tuple? Thanks, Fabian
2 May 25 2012
04:47
May 25 2012
05:29
RCfile
By: yingnan.ma
Hi, I want to use RCfile to address the IO problem, and I can not find some paper about how to install or how to use it by PIG, so if you had some install or configue file, you could share with me. Thank you. Best Regards Malone 2012-05-24 …
3 May 24 2012
01:26
May 25 2012
01:09
UDF FilterFunc and logical OR
By: Johannes Schwenk
Hello List, I am using Clouderas distribution (cdh3u3) which comes with pig-0.8.1. I have written a UDF extending FilterFunc that checks if the provided string is contained within the specified column of the current tuple:…
8 May 21 2012
11:37
May 24 2012
12:16
InstanceAlreadyExistsException on initializing MiniCluster
By: Johannes Schwenk
Hi everybody, I'm trying to run some unit tests for a custom LOAD function that use MiniCluster. I get the following exception when running on pig 0.10.0 : http://pastebin.com/sNJBgJHC For pig 0.8.1 I don't get that error. I have no idea how to…
2 May 24 2012
08:36
May 24 2012
11:57
Diagnostic Operators inside Macros
By: Daniel Duckworth
Hello everyone, I have been trying to debug some macros I've written, but I'm finding that every Diagnostic Operator (describe, dump, explain, and illustrate) are all being recognized as illegal inside a macro. I could find no documentation…
0 May 24 2012
11:52
May 24 2012
11:52
Create rdbms like sequence in Pig on Pig Relation
By: DIPESH KUMAR SINGH
I want to create a rdbms like sequence on a Pig relation. Is there any existing UDF which could do this? I am bit new to pig, Kindly suggest how to proceed? Thanks & Regards,
10 May 16 2012
12:42
May 24 2012
00:55
Writing to rcfile
By: Mohit Anchlia
Could someone please point me to someway where I can store in rcfile format with snappy compression? I need to use this output in hive.
1 May 22 2012
16:14
May 23 2012
16:48
Recommendations for compression
By: rakesh sharma
Hi Guys, I am writing data in hadoop using java client. The source of data for java client is a messaging data. The java client rotates files every 15 minutes. I use PigServer to submit map reduce job on the just closed file. These files have data…
2 May 23 2012
13:15
May 23 2012
14:01
While/CROSS/FOREACH loop
By: Russell Jurney
I need to repeatedly CROSS a data set, then FOREACH it, reduce it with a filter, then group/test it to test if it's done yet, then repeat until it is baked. How do I do that with pig, and maybe some other tool? Twitter has some ruby stuff that can…
0 May 23 2012
01:32
May 23 2012
01:32
why pig did not compile contrib directory during "ant jar"? How to compile contrib directo
By: lulynn_2008
Hi, During generating pig jar files, I found the contrib directory is not compiled. I assume maybe this is because the contrib directory is not for pig mainline functions. Am I right? Besides, if I want to compile the contrib directory, how to do…
7 May 22 2012
22:08
May 23 2012
00:13
: what jar are included in pig-0.10.0.jar and pig-0.10.0-withouthadoop.jar?
By: lulynn_2008
Hi, I found two jar files in pig-0.10.0 package: pig-0.10.0.jar and pig-0.10.0-withouthadoop.jar. I have questions about them: 1. Seems the differences between these two jars are: pig-0.10.0.jar includes all the dependencies of pig, and…
3 May 20 2012
22:03
May 22 2012
22:03
Re: Design issue, need feedback
By: Nerius Landys
> There are a couple of ways that you can do this. One, is that you could > make a special loader that converts your format to a map of (key,value) > pairs, and then you can project however you want. > > Another (better, if at all…
2 May 22 2012
13:30
May 22 2012
15:47
Parse XML file with PIG
By: Herbert Mühlburger
Hi list, I would like to parse the following XML-File using Pig: <page> <id>1</id> <revision> <id>1</id> <username>muehlburger</username> </revision> <revision> …
4 May 17 2012
03:31
May 22 2012
10:36
User survey: Where are all the UDFs and Macros?
By: Russell Jurney
Most Pig UDF development has moved away from Piggybank. I want to document where these UDFs are in a central place. If you know where some cool Pig macros, streaming examples or UDFs are, be they Java, Python, Jython, JRuby, or whatever... please…
5 May 15 2012
20:36
May 21 2012
19:13
Problem loading sequence files with Elephant Bird
By: Chris Diehl
Hi All, I'm attempting to load sequence files for the first using Elephant Bird's sequence file loader and having absolutely no luck. I did a hadoop fs -text one on of the sequence files and noticed all the keys are (null). Not sure if that is…
6 May 16 2012
13:48
May 21 2012
15:40
How to use TOP?
By: Mohammad Tariq
Hello list, I have an Hdfs file that has 6 columns that contain some data stored in an Hbase table.the data looks like this - 18.98 2000 1.21 193.46 2.64 58.17 52.49 2000.5 4.32 947.11 2.74 64.45 115.24 2001 …
3 May 21 2012
06:55
May 21 2012
14:46
Jython Interpretor error
By: Manish Bhoge
Hi All, I have written some UDF in Python script (test.py). Now i want to register this python script with PIG-GRUNT. But when i registering this script i am getting below errors. Case 1: When i place the python script in local directory like…
2 May 19 2012
22:20
May 19 2012
22:43
Python UDF "import RE" bug
By: Saurabh S
Is there any way to bypass the bug listed here: https://issues.apache.org/jira/browse/PIG-1824 Some discussion is here:…
1 May 17 2012
17:24
May 19 2012
12:08
STRSPLIT problems (or UDF shortcoming?)
By: Nerius Landys
I'm having problems using Pig's STRSPLIT (on Amazon's cloud computing environment). I also noticed that STRSPLIT isn't documented in the Pig Latin Reference Manual, so I found out about it through other sources of information. My problem is that…
16 May 17 2012
12:58
May 18 2012
22:00
Load Pig metadata from file?
By: Saurabh S
Here is a sample LOAD statement from Programming Pig book: daily = load 'NYSE_daily' as (exchange:chararray, symbol:chararray, date:chararray, open:float, high:float, low:float, close:float, volume:int, adj_close:float); …
5 May 15 2012
17:35
May 18 2012
12:09