Best unofficial Apache Server developers community
Username
Forgot password?
Sign in with Twitter account
Sign in with Facebook account
List archives

Hadoop Pig users Recent threads

Threads Replies First post Last post
Apache Pig bylaws
By: Alan Gates
As directed in our vote to become a TLP, we (Pig's PMC) need to set out bylaws for the project. I have put up a first proposal for these by laws at http://wiki.apache.org/pig/ProposedByLaws. Please take a look and give feedback. Alan.
7 Sep 27 2010
20:19
Oct 1 2010
19:01
Grouping & Counting
By: Rob Wilkerson
Hey guys - I have a script that loads a list of ~800,000 category hierarchies, filters them a bit and streams them through a PHP script for some quick procedural work. The file contains one column and a snippet looks like this: Arts…
7 Oct 1 2010
06:33
Oct 1 2010
11:57
how to use option mapred.max.map.failures.percent ?
By: Wojciech Langiewicz
Hello, I'm running very big MR job with Pig, and sometimes some maps fail, but I would like this job to finish anyway. I know that option "mapred.max.map.failures.percent" is what I need, but how to pass this option when running Pig? (from…
2 Sep 28 2010
03:50
Sep 30 2010
19:39
funny error
By: hc busy
Guys, I'm seeing this one 2998 Unexpected internal error. Can we be more specific or dump a stack trace when this happens?
7 Sep 29 2010
22:10
Sep 30 2010
18:13
Re: Problem with LzoTokenizedLoader with elephant-bird branch for Pig 0.7
By: Rohan Rai
Hi Which Hadoop/ PIg version are you using ?? Regards Rohan ed wrote: > Hello, > > I tested the newest push to the hirohanin elephant-bird branch (for pig 0.7) > and had an error when trying to use LzoTokenizedLoader with the…
4 Sep 29 2010
22:57
Sep 30 2010
16:01
pig speed in local mode
By: Konstantin Ignatyev
Hi, I am trying to write pig script that is quite complex so I am testing it against very small data subset in local mode. However it might take up to 2 _minutes_ to finish. Or 30 seconds if I execute only parts of it. That is quite annoying to…
3 Sep 30 2010
15:09
Sep 30 2010
15:56
Magic numbers in my pig scripts
By: Eric Wadsworth
Hi folks! I'm brand new to this list, so apologies if this is an inappropriate newbie question, or is otherwise incorrect, but here goes. I'm working with a bunch of pig scripts, and we're adding new ones almost daily. They are getting more and…
7 Sep 29 2010
12:01
Sep 30 2010
15:31
Re: Accessing Nested Json
By: hc busy
hooray! On Wed, Sep 29, 2010 at 4:24 PM, Alan Gates <ga### @yahoo-inc.com> wrote: > > On Sep 29, 2010, at 3:46 PM, hc busy wrote: > > I thought map can only take bytearray as value type? >> > > No, it can take any…
0 Sep 29 2010
22:09
Sep 29 2010
22:09
Accessing Nested Json
By: rakesh kothari
Hi, Is there a good way to access nested properties that are multilevel deep from Json objects loaded in Pig ? For example, if my json is like: {"keyA":{"pA":"vA"}} and I need to access "pA". Thanks, -Rakesh
1 Sep 28 2010
14:45
Sep 29 2010
15:54
project on pigerry
By: Yoomeosympli
Kindly give a set of project on the above, for a degree course
1 Sep 28 2010
14:56
Sep 29 2010
15:52
Streaming to PHP
By: Rob Wilkerson
I have a Pig script--currently running in local mode--that processes a huge file containing a list of categories: /root/level1/level2/level3 /root/level1/level2/level3/level4 ... I need to insert each of these into an existing…
1 Sep 29 2010
07:16
Sep 29 2010
11:30
(PigJsonLoader) how to read/load json with Pig?
By: Benny Sadeh
loading/reading json for Pig processing sounds like a common useful functionality. however, I have not found any implementation for such. (and yes, I know of Elephant Bird, which reads LZO-compressed json (but not regular json)) but I did see a…
2 Sep 28 2010
11:00
Sep 28 2010
22:53
Problem with LzoTokenizedLoader with elephant-bird branch for Pig 0.7
By: pig
Hello, After getting all the errors to go away with LZO libraries not being found and missing jar files for elephant-bird I've run into a new problem when using the elephant-bird branch for pig 0.7 The following simple pig script works as…
10 Sep 23 2010
08:50
Sep 28 2010
13:11
Re: help : error run pig
By: Jeff Zhang
But it's weird that the port in your error is 54310 while in your configuration file it is 9000 On Tue, Sep 28, 2010 at 10:22 AM, Ngô V?n V? <ngovi.### @gmail.com> wrote: > "192.168.230.130" is IP of my machine > @JeffZhang: can you…
1 Sep 27 2010
21:42
Sep 27 2010
21:43
BytesWriteable support in Piggybank SequenceFileLoader?
By: Zach Bailey
Hey folks, Not sure if this has been discussed already or if this is due to some limitation in pig, hadoop, or java - but is there a particular reason the PiggyBank SequenceFileLoader doesn't support the BytesWritable type for sequence file…
3 Sep 27 2010
15:30
Sep 27 2010
19:11
NOT matches
By: Russell Jurney
How do you filter a relation by a field NOT matching a regex? You would think this would work, but it does not: B = FILTER A BY field_foo NOT matches 'test' Russ
4 Sep 26 2010
12:50
Sep 27 2010
13:43
Is there a workaround to "Projection map of new predecessor is null" ??
By: hc busy
Hi guys, I wanted to check if anybody has fixed this error and recall how to fix it? 2010-09-21 14:43:46,288 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2160: Error during fixing projections. Projection map of new predecessor is null. …
2 Sep 21 2010
16:53
Sep 27 2010
11:22
Best way to work on Cassandra Data?
By: Christian Decker
It's been some while since I started using Cassandra in combination with Pig, but I still haven't figured out the best way to work with the data. I wrote some Index Readers based on the format that the contributed CassandraStorage introduced (a…
1 Sep 26 2010
10:47
Sep 27 2010
08:40
Announcement: CCGrid 2011: Call for Papers
By: Viraj Bhat
Pig Users,
0 Sep 26 2010
17:25
Sep 26 2010
17:25
PigPen 'Could not open the editor' Message
By: Matt Tanquary
I just dropped the plugin in (Eclipse Galileo on Suse Linux), and tried to open a pig doc and get this error: Could not open the editor: The editor class could not be instantiated. This usually indicates a missing no-arg constructor or that the…
1 Sep 24 2010
16:46
Sep 24 2010
16:55