&
Best unofficial Apache Server developers community
Username
Remember me?
Password
Forgot password?
Create an account
List archives
Videos
Answers
Questions
Unanswered
Tags
Ask Question
hadoop
0
votes
1
answers
213
views
Hive Jdbc client on hadoop - getting FileNotFoundException file:/user/hive/warehouse/test does not exist exception
Hi i am trying to run the simple Hive Jdbc client program i am getting FileNotFoundException. But when i try with local file path it's working fine.…
client
hadoop
hive
jdbc
on
asked
February 23, 2012 3:14 am AMT
chandra
0
votes
0
answers
131
views
Caching dynamically generated images in Rails
I am using PhantomJS to dynamically generate 10 large images of websites at a time in each request. Therefore it is important that I cache these…
ruby-on-rails
ruby
caching
hadoop
amazon-web-services
asked
June 25, 2011 2:01 pm CDT
Justin Meltzer
0
votes
0
answers
40
views
Hadoop mapper emits unique key. can I perform reducer after per map?
My mapper emits 'uniq key' - 'very large value' pair. My reducer doesn't know the key is unique. Thus, reducer waits all the mappers are completed.…
hadoop
mapreduce
asked
June 25, 2011 8:29 am CDT
Ted Kim
0
votes
0
answers
52
views
The type Mapper is not generic, hadoop eclipse pluggin
I am using eclipse to write mapreduce program. I imported hadoop library (hadoop-0.13.0-core.jar) I imported Mapper class import…
hadoop
hadoop-plugins
asked
June 24, 2011 5:06 pm CDT
bigbang
1
vote
1
answers
46
views
Computing distinct and common lines of two files with hadoop
Sorry for cross-posting this on the hadoop user mailing list and here, but this is getting an urgent matter for me. My problem is as follows: I have…
hadoop
set-intersection
asked
June 24, 2011 9:22 am CDT
raven_arkadon
0
votes
0
answers
35
views
Possible to use Map Reduce and Hadoop to parallel process batch jobs?
Our organization has hundreds of batch jobs that run overnight. Many of these jobs require 2, 3, 4 hours to complete; some even require up to 7…
hadoop
parallel-processing
mapreduce
asked
June 24, 2011 9:15 am CDT
RaffiM
0
votes
0
answers
39
views
Hibernate Session closings with Hadoop
I'm an intermediate Hibernate user. I am trying to get some traction with Hadoop at my company. I'm using a library called spring-hadoop…
hibernate
spring
hadoop
asked
June 23, 2011 5:08 pm CDT
rajat banerjee
0
votes
0
answers
41
views
Combine MapReduce result with data
How could i combine with map/reduce these two files: File1. Data. 1 name:foo1,position:bar1 2 name:foo2,position:bar2 3 name:foo3,position:bar3 4…
join
hadoop
mapreduce
asked
June 23, 2011 9:10 am CDT
user812366
0
votes
0
answers
49
views
Hadoop & HBase CDH3 Distro on Ubuntu 11.04 - natty
I wanted to know if anyone had success running the CDH3 Cloudera release of Hadoop and HBase on the latest version of Ubuntu - natty 11.04..? I have…
hadoop
hbase
bigtable
cloudera
asked
June 23, 2011 7:51 am CDT
NightWolf
0
votes
0
answers
48
views
Rsync files to hadoop
I have 6 servers and each contains a lot of logs. I'd like to put these logs to hadoop fs via rsync. Now I'm using fuse and rsync writes directly to…
hadoop
rsync
asked
June 23, 2011 1:24 am CDT
Michal
0
votes
0
answers
39
views
Hadoop copyFromLocal problem wit copying directory
I'd like to copy whole local directory with some subdirectories and files to HDFS. HDFS already contains the root directory and some subdirectories…
hadoop
asked
June 23, 2011 12:05 am CDT
Michal
0
votes
1
answer
46
views
No namenode error in pseudo-mode
I'm new to hadoop and is in learning phase. As per Hadoop Definitve guide, i have set up my hadoop in pseudo distributed mode and everything was…
hadoop
asked
June 22, 2011 6:48 pm CDT
Anshu Basia
1
vote
0
answers
48
views
Implementing parallel-for in hadoop
I would like to implement a parallel-for in on hadoop. Basically parallel-for receives a sub-skeleton (it could be a function like map() ) and an…
java
hadoop
asked
June 22, 2011 4:44 pm CDT
user811188
0
votes
0
answers
58
views
COLLECT_SET() in Hive, keep duplicates?
Is there a way to keep the duplicates in a collected set in Hive, or simulate the sort of aggregate collection that Hive provides using some other…
java
hadoop
udf
hive
asked
June 22, 2011 2:23 pm CDT
Travis Powell
0
votes
0
answers
44
views
Where do I download all of the necessary classes to write Hadoop MapReduce jobs?
I've recently started working with Hadoop and have been learning how to write MapReduce jobs. All over the internet, I can find examples and…
api
class
download
hadoop
mapreduce
asked
June 22, 2011 12:26 pm CDT
Kurtis
0
votes
0
answers
42
views
How to get files of fixed size in map-reduce job output
I have a use case where I want to process data and generate output of fixed size , say 1 GB i.e. each map-reduce job output should be 1 Gb. Does…
hadoop
mapreduce
asked
June 22, 2011 11:45 am CDT
user656189
0
votes
2
answers
41
views
Uploading large gzipped data files to HDFS
I have a use case where I want to upload big gzipped text data files (~ 60 GB) on HDFS. My code below is taking about 2 hours to upload these files…
java
hadoop
hdfs
gzipstream
asked
June 22, 2011 11:23 am CDT
user656189
0
votes
3
answers
67
views
COLLECT_SET() in Hive (Hadoop)
I just learned about the collect_set() function in Hive, and I started a job on a development 3-node cluster. I only have about 10 GB to process.…
hadoop
mapreduce
udf
hive
asked
June 21, 2011 6:48 pm CDT
Travis Powell
0
votes
0
answers
47
views
ML/Data Mining/Big Data : Popular language for programming and community support
I am not sure if this question is correct, but I am asking to resolve the doubts I have. For Machine Learning/Data Mining , we need to learn about…
java
python
hadoop
machine-learning
bigdata
asked
June 21, 2011 12:54 pm CDT
daydreamer
0
votes
1
answers
82
views
Will using hadoop 20-append with hbase 90.3 break?
Trying to install hbase, but the word on the street is that if I don't use a hadoop from the 20-append branch, I'll lose data. This tutorial says…
hadoop
hbase
asked
June 20, 2011 6:00 pm CDT
nnythm
Pages
:
1
|
2
|
3
|
4
|
5
>
[28]
543
hadoop
Tagged:
hadoop
Related Tags
mapreduce
× 121
java
× 90
hbase
× 62
hdfs
× 49
hive
× 36
pig
× 33
python
× 19
piglatin
× 15
cloudera
× 13
mahout
× 12
amazon-ec2
× 12
cluster
× 12
mysql
× 11
map
× 10
amazon-web-services
× 10
amazon-emr
× 9
apache
× 9
distributed-computing
× 9
reduce
× 9
cassandra
× 8
lucene
× 8
nosql
× 8
hadoop-plugins
× 8
streaming
× 7
php
× 7
ubuntu
× 7
nutch
× 6
thrift
× 6
xml
× 6
English
Russian
Copyright 2007 - 2012
Best unofficial Apache Server developers community
Privacy policy