网站首页  词典首页

请输入您要查询的英文单词:

 

单词 hadoop
释义

hadoop

  • 网络分布式计算;分布式计算平台;分布式文件系统
1.
分布式计算
-有分布式计算(Hadoop)经验者优先 5、搜索级研发工程师 岗位描述 : -负责搜索相关业务的架构设计与开发 -负责搜索相关服务 …
www.pin5i.com
2.
分布式计算平台
b) 有大规模分布式计算平台Hadoop)的使用和并行算法开发及应用经验; c) 优秀的沟通表达能力。
www.cognoschina.net
3.
分布式文件系统
...多disk啊 太crazy了 是用来做数据仓库还是分布式文件系统hadoop)了 小可才疏学浅 目前见识过40个的 780跑DB的
www.aixchina.net
4.
大数据
...服人456:从目前接触的技术来看,无论是虚拟机还是大数据(hadoop)都是在基于二层网络架构进行数据传输,一但出现跨网 …
www.bellsent.com
5.
云计算和大数据
摘 要:英特尔CAS 2.0解决方案,在数据库/OLTP、虚拟化、云计算和大数据Hadoop)应用场景中可带来显著的I/O和应用性 …
www.d1net.com
6.
分布式并行处理框架
...的算法和相关理论(模式识别,人工智能) (2)熟知分布式并行处理框架Hadoop),并具有...
www.jobyun.com

例句

释义:
1.
If I'm a developer using Hadoop and want to look at a bit of data, it will let me run some reports against the file system.
如果我是个使用Hadoop的开发者,想要查看一些数据,那么就可以通过文件系统报表达成所愿。
www.infoq.com
2.
We're trying to follow the path taken by the Hadoop project concentrating on robustness, scaling, correctness, and community-building first.
我们将追随Hadoop项目所采取的路线,首先把精力集中在健壮性、扩展性、正确性以及社区建立上。
www.infoq.com
3.
As the hadoop-0. 20 is one of your primary interfaces to the Hadoop cluster, you'll see this utility used quite a bit through this article.
因为hadoop-0.20是Hadoop集群的主要接口之一,您会看到本文中多次使用这个实用程序。
www.ibm.com
4.
Now that I've coded my map and reduce implementations, all that's left to do is link everything up into a Hadoop Job.
现在我已经对我的map和reduce实现进行了编码,接下来所要做的是将所有这一切链接到一个HadoopJob。
www.ibm.com
5.
From this article, it's easy to see how Hadoop makes distributed computing simple for processing large datasets.
通过本文很容易看出Hadoop显著简化了处理大型数据集的分布式计算。
www.ibm.com
6.
All that's needed is a representation of the data in a vector form that the Hadoop infrastructure can use.
所有这一切的需要就是用矢量格式表达Hadoop基础设施可以使用的数据。
www.ibm.com
7.
Well, as you've probably guessed, Hadoop makes that easy to do.
当然,您已经猜到了,Hadoop可以轻松地做到。
www.ibm.com
8.
But from the previous discussion, it's easy to see how Hadoop provides parallel processing of work.
但是,通过前面的讨论很容易看出Hadoop如何提供并行处理。
www.ibm.com
9.
Not to be outdone, commercial Hadoop pioneer Cloudera announced an HDFS partnership of its own yesterday.
商业Hadoop的先驱Cloudera也不甘示弱,于昨天发布了自己的HDFS合作伙伴计划。
blog.sina.com.cn
10.
A key part of the announcement was that Yahoo would make available a Hadoop enabled super computing data center named M45.
该声明的关键是Yahoo将建立一个使用Hadoop的超级计算数据中心,名为M45。
www.infoq.com
1.
Alas, there are several things that Hadoop does not do, at least when accessed through the MapReduce interface.
唉,有几件事情Hadoop也不做,至少在通过MapReduce访问接口。
blog.sina.com.cn
2.
Now you have set up the Hadoop Cluster on the cloud, and it's ready to run the MapReduce applications.
现在,已经在云中设置了Hadoop集群,该运行MapReduce应用程序了。
www.ibm.com
3.
Since we are going to be connecting to the hadoop file system, we might as well test that as well.
因为我们要连接到hadoop文件系统,我们不妨测试。
blog.sina.com.cn
4.
One particularly handy aspect of Hadoop is that it handles the raw parsing of an input file, so that you can deal with one line at a time.
Hadoop可以对输入文件进行原始解析,这一点特别有用,这样您就可以每次处理一行。
www.ibm.com
5.
For all the other settings, keep the defaults or choose the same values as you did for the Hadoop Master node.
对于所有其他设置,保留其默认值或者选择与HadoopMaster节点相同的值。
www.ibm.com
6.
It is assumed that the Hadoop slave node has been configured a priori in such a manner that it registers with the Hadoop master node.
这里假设Hadoop从节点已经在之前配置完成,也就是它已经注册到Hadoop主节点中。
www.infoq.com
7.
Now that you have installed Hadoop and tested the basic interface to its file system, it's time to test Hadoop in a real application.
既然已经安装了Hadoop并测试了文件系统的基本接口,现在就该在真实的应用程序中测试Hadoop了。
www.ibm.com
8.
This article introduces you to the important configurable parameters of Hadoop and the method for analyzing and tuning performance.
本文介绍重要的Hadoop可配置参数以及分析和调优性能的方法。
www.ibm.com
9.
That magically seems to work, indicating that we can, indeed, connect to another machine and run the hadoop commands.
魔法般的似乎工作,表明我们可以,事实上,连接到另一台机器上,运行hadoop命令。
blog.sina.com.cn
10.
The Hadoop runtime will split up the data (log files) that needs to be processed and give each node in your cluster a chunk of data.
Hadoop运行时将分割需要处理的数据(一些日志文件)并向您的集群中的每个节点分配一个数据块。
www.ibm.com
1.
data format designed to support data-intensive applications, and provides support for this format in a variety of programming languages.
Avro[1]是最近加入到Apache的Hadoop家族的项目之一。为支持数据密集型应用,它定义了一种数据格式并在多种编程语言中支持这种格式。
www.infoq.com
2.
One irony of this code and the Hadoop framework is that the input files do not have to be in the same format.
一个讽刺,这段代码和Hadoop框架是输入文件不需要在相同的格式。
blog.sina.com.cn
3.
Hadoop is really designed to run in a distributed manner where it handles the coordination of various nodes running map and reduce.
Hadoop的设计旨在以一种分布式方式运行,处理运行map和reduce的各个节点之间的协调性。
www.ibm.com
4.
You can perform a couple of tests to ensure that Hadoop is up and running normally (at least the namenode).
可以通过几个检查确认Hadoop(至少是namenode)已经启动并正常运行。
www.ibm.com
5.
Thanks to the cloud and Hadoop, it is now possible to handle large amounts of structured or unstructured data in a timely manner.
由于云和Hadoop的出现,及时处理大量的结构化或非结构化数据目前已成为可能。
www.ibm.com
6.
So over the past 2 weekends, I've worked on a hobby project, which lets you turn your Hudson cluster into a Hadoop cluster.
所以在过去的两个周末里,我一直在从事一个业余爱好项目,这个项目可以把Hudson集群转化成Hadoop集群。
www.bing.com
7.
Run the clustering algorithm of choice using one of the many Hadoop-ready driver programs available in Mahout.
使用Mahout中可用的Hadoop就绪的驱动程序运行所选集群算法。
www.ibm.com
8.
The two core components are the Hadoop Distributed File System for storing data and Hadoop MapReduce for writing parallel-processing jobs.
其中两个核心组件是用于存储数据的HadoopDistributedFileSystem(Hadoop分布式文件系统)和用于写入并行处理任务的HadoopMapReduce。
blog.sina.com.cn
9.
The company employs many of the core Hadoop contributors and intends to provide support and training.
该公司雇佣了众多Hadoop项目的核心人员欲以提供相应的支持和培训。
www.infoq.com
10.
Open source software designed by IBM to help students develop programs for clusters running Hadoop.
IBM设计了开源软件去帮助学生们为运行Hadoop的集群开发程序。
www.infoq.com
1.
You could just use the raw output from Hadoop (a name and value on each line, separated by a space).
您可以只是使用来自Hadoop的原始输出(每行上有一个名称和值,用空格分隔)。
www.ibm.com
2.
As a distributed framework, Hadoop enables many applications that benefit from parallelization of data processing.
作为分布式框架,Hadoop让许多应用程序能够受益于并行数据处理。
www.ibm.com
3.
Standalone Mode: By default, Hadoop is configured to run in a non-distributed standalone mode.
单独模式:在默认情况下,Hadoop以非分布的单独模式运行。
www.ibm.com
4.
If not what is the plan in terms of moving it from an experimental technology to a core infrastructure component.
如果还没有,有什么计划让Hadoop从一个实验性的产品向核心基础组件迁移?
www.infoq.com
5.
developed Hadoop, permits AI systems to run data and algorithms across multiple servers simultaneously.
的结合,可以让AI系统在多个服务器上同时的运行数据和算法。
www.bing.com
6.
This flexibility can open new opportunities for Hadoop in a richer set of applications.
在更加丰富的应用程序集中此灵活性可以为Hadoop创造新的机会。
www.ibm.com
7.
feel this would be a big boost to both performance and utility, and it would leverage the power already provided by the Hadoop framework.
我觉得这将是一个巨大的鼓舞作用及表现的用途上,而它将影响作用的力量已经提供Hadoop框架。
blog.sina.com.cn
8.
Those log files can be huge, but the work will be split up among the machines (nodes) in your Hadoop cluster.
那些日志文件可能很大,但是挖掘工作将在您的Hadoop集群中的多个机器(节点)之间分配。
www.ibm.com
9.
Instead, Hadoop can be viewed as a way to distribute both data and algorithms to hosts for faster parallel processing.
相反地,Hadoop可以被视为一种可以同时将数据和算法分配到主机以获得更快速的并行处理速度的方法。
www.ibm.com
10.
The next article in this series will explore how to configure Hadoop in a multi-node cluster with additional examples. See you then!
本系列中的下一篇文章通过更多示例讨论如何在多节点集群中配置Hadoop。
www.ibm.com
1.
To achieve speed and scalability, Hadoop relies on MapReduce, a simple but powerful framework for parallel computation.
为了实现快速和可伸缩性,Hadoop依赖于MapReduce,一个简单但强大的并行计算框架。
www.ibm.com
2.
With that disclaimer in place, let's dive right into Hadoop installation and configuration.
现在,我们来讨论Hadoop的安装和配置。
www.ibm.com
3.
Hadoop is an open-source, flexible Java framework for large-scale data processing on commodity hardware networks.
Hadoop是一个灵活的开放源码Java框架,用于在一般硬件网络上执行大规模数据处理。
www.ibm.com
4.
Hadoop Tutorial introduces Hadoop and provides a detailed discussion of its use and configuration.
Hadoop教程介绍了Hadoop并提供其使用与配置的详细讨论。
www.ibm.com
5.
My ultimate intention is to use this code to easily characterize input and result files that I create in the process of writing Hadoop code.
我的最终目的是使用这个代码输入和结果容易描述文件建立在写作过程中Hadoop代码。
blog.sina.com.cn
6.
This article described step-by-step instructions for setting up a three-node Hadoop cluster in minutes on the IBM Cloud.
本文描述了数分钟内在IBMCloud上组建由三节点组成的Hadoop集群的详细步骤。
www.ibm.com
7.
For more information, see the Hadoop Web site and resources, as well as the IBM cloud computing resources (see Resources).
有关更多信息,请参见Hadoop网站及资源,以及IBM云计算资源(参见参考资料)。
www.ibm.com
8.
So they use daily batch processing with Hadoop as an important part of their calculations.
因此,他们将每天对Hadoop的批处理作为计算的重要组成部分。
www.infoq.com
9.
This means that OCCI-managed virtual machines with Hadoop's Task Trackers and Data Nodes must be added on-demand.
这意味着必须随需增加以OCCI管理的、附有Hadoop任务跟踪器和数据节点的虚拟机。
www.infoq.com
10.
That benefit applies to your architecture too - you'll find out sooner that EJB or Hadoop is or is not working out.
它所带来的益处同样适合于架构——能更早地发现EJB或Hadoop是否适用。
www.infoq.com
1.
Keep it in mind, though, that Hadoop is capable of handling much larger data sets.
记住,Hadoop有能力处理更大的数据集。
www.ibm.com
2.
In addition, the steps of the various flows are intelligently converted into map-reduce invocations against the hadoop cluster.
此外,各种流的步骤被智能地转换成对应于hadoopcluster的map-reduce调用。
www.infoq.com
3.
Hadoop can be used in many applications beyond simply computing word counts of large data sets.
Hadoop可用于许多应用程序上,其已超越了为大型数据集简单计算字数的工作。
www.ibm.com
4.
How about using Hadoop for storing old artifacts, so that you can utilize the combined storage of a cluster?
为什么不试试使用Hadoop来存储那些旧东西,这样一来,你就可以利用整个集群的联合存储了。
www.bing.com
5.
My map implementation in Listing 6 is simple: Hadoop basically invokes this class for each line of text it finds in an input file.
清单6中的map实现比较简单:本质上是,Hadoop为在输入文件中找到的每一行文本调用这个类。
www.ibm.com
6.
Hadoop scales out to myriad nodes and can handle all of the activity and coordination related to data sorting.
Hadoop可以扩展到无数个节点,可以处理所有活动和相关数据存储的协调。
www.ibm.com
7.
EMC's Hadoop strategy is actually quite unique, and its decision to embrace MapR is strong evidence of this.
EMC的Hadoop策略实际上非常独特。EMC采用MapR的存储有力地证明了这一点。
blog.sina.com.cn
8.
They say it is in their interest that the Hadoop project is not split and grows over time.
他们对Hadoop项目没有被分拆反而一直都在发展壮大表现出了兴趣。
www.infoq.com
9.
My purpose is to demonstrate the idea of using Hadoop to do normalization, rather than producing 100% working code.
我的目的是要证明这种利用Hadoop做正常化,而不是生产100%工作代码。
blog.sina.com.cn
10.
By the way, I feel that both of these are very, very reasonable requests, and the hadoop framework should support them.
顺便说一句,我觉得这两种非常,非常的合理要求,hadoop框架应当支持他们。
blog.sina.com.cn
1.
Run this code and you'll see a bunch of text fly across the screen as Hadoop begins doing its work.
运行这些代码,Hadoop开始运行时您将可以看到一堆文本在屏幕上一闪而过。
www.ibm.com
2.
An edge node is a machine that has the Hadoop libraries installed , yet is not part of the actual cluster.
edge节点是安装有Hadoop库的计算机,但不是真正簇集中的一部分。
www.infoq.com
3.
Apache Hadoop is a software framework (platform) that enables a distributed manipulation of vast amount of data.
ApacheHadoop是一个软件框架(平台),它可以分布式地操纵大量数据。
www.ibm.com
4.
Hive originated from within Facebook, and makes it possible to use SQL queries against Hadoop, making it easier for non-programmers to use.
Hive就是发源于Facebook,使得对于Hadoop使用的SQL查询成为可能,从而是其更容易对非程序员使用。
guyot.blog.163.com
5.
Some specify file paths on your system, but others adjust levers and knobs deep inside Hadoop's guts.
一些变量指定系统上的文件路径,而其他变量对Hadoop的内部进行深入的调整。
www.ibm.com
6.
For this image, all Hadoop components are automatically started as soon as the image is in active status.
对于此映像,只要映像变为活动状态,就会自动启动所有的Hadoop组件。
www.ibm.com
7.
He noted "we believe Hadoop is now ready for mainstream enterprise use. "
他指出“我们相信Hadoop已经为主流企业的应用做好了准备”。
www.infoq.com
8.
This process requires understanding Java generics, because Hadoop prefers explicit type safety.
这个过程需要理解Java泛型,因为Hadoop选择使用显式类型,为了安全起见。
www.ibm.com
9.
In the Resources section, you can find more details on Hadoop architecture, components, and theory of operation.
在参考资料中,可以找到关于Hadoop架构、组件和操作理论的更多信息。
www.ibm.com
10.
HBase is a database representation over Hadoop's HDFS, permitting MapReduce to operate on database tables over simple files.
HBase是数据库在Hadoop的HDFS上的表现,在简单文件上执行MapReduce以操作数据库表。
www.ibm.com
1.
How to verify your cluster is working by stopping and starting all Hadoop components, testing a few commands, and reviewing the web console.
如何通过停止与启动所有Hadoop组件、测试一些命令以及检查Web控制台来验证集群。
www.ibm.com
2.
Hopefully, this knowledge can help you make full use of your Hadoop cluster and result in finishing your jobs more efficiently.
希望这些知识能够帮助您充分使用Hadoop集群,更高效地完成作业。
www.ibm.com
3.
We get to take advantage of a huge amount of work that's already been done in Hadoop - many bits of HBase are reused from Hadoop.
我们利用了Hadoop中已经完成的大量工作——HBase的许多代码是重用Hadoop的代码。
www.infoq.com
4.
Although Hadoop is not well suited to every scenario, it provides clear performance benefits.
尽管Hadoop并不是对每个场景都适合,但是它提供了良好的性能效益。
www.ibm.com
5.
Hudson could also install Hadoop binaries on all the nodes as necessary, really making this solution a turn-key.
只要有必要,Hudson也可以在所有结点安装Hadoop二进制码,而这就是解决方案的关键点。
www.bing.com
6.
To set up Hadoop in standalone or pseudo-distributed mode, refer to the Hadoop Web site for reference.
要想以单独或伪分布模式设置Hadoop,请参考Hadoop的网站。
www.ibm.com
7.
The first two articles of this series focused on the installation and configuration of Hadoop for single- and multinode clusters.
此系列的前两篇文章专注于单节点和多节点集群的Hadoop安装及配置。
www.ibm.com
8.
Apache Hadoop Core is a software platform that lets one easily write and run applications that process vast amounts of data.
Hadoop为编写和运行处理大量数据的应用提供一个软件平台。
www.bing.com
9.
Interesting visualizations are fairly common, but they typically are not pluggable into the existing tooling around Hadoop.
有趣的可视化比较常见,但是它们通常不能插入现有的Hadoop相关工具。
www.ibm.com
10.
Pseudo-distributed Mode: Hadoop can also be run in a single node pseudo-distributed mode.
伪分布模式:Hadoop还可以以单节点的伪分布模式运行。
www.ibm.com
1.
Fully-distributed Mode: Hadoop is configured on different hosts and run as a cluster.
全分布模式:Hadoop配置在不同的主机上,作为集群运行。
www.ibm.com
2.
It provides a query language over Hadoop data while supporting the traditional Hadoop programming model.
其在Hadoop数据上提供查询语言,此语言支持传统的Hadoop编程模型。
www.ibm.com
3.
This utility is how you interact with the Hadoop cluster, from inspecting the file system to running jobs in the cluster.
这个实用程序用于与Hadoop集群交互,包括检查文件系统、在集群中运行作业等等。
www.ibm.com
4.
Starting every java node as a mapper task incurs an overhead of starting a new JVM in Hadoop cluster.
它把每个java节点都作为mapper任务启动,这会导致在Hadoop簇集中启动新的JVM而产生额外的开销。
www.infoq.com
5.
Every time I introduce a new data structure used by the Hadoop framework, I have to define two classes.
每次我介绍一个新的数据结构Hadoop使用框架,我必须定义两个阶级。
blog.sina.com.cn
6.
We definitely recommend Cascading to anyone doing serious data processing and mining with Hadoop.
我们向那些正在用Hadoop进行大量数据处理和挖掘的人强烈推荐Casading。
www.infoq.com
7.
Knowing that all of your processes are available, you can use the hadoop command to inspect the local namespace (see Listing 2).
确认所有进程都在运行之后,可以使用hadoop命令检查本地名称空间(见清单2)。
www.ibm.com
8.
The Hadoop Master node instance must be provisioned first.
必须首先提供HadoopMaster节点实例。
www.ibm.com
9.
The Hadoop ecosystem demonstrates the rich community behind the project.
Hadoop生态系统演示了项目背后丰富的社区活动。
www.ibm.com
10.
The Hadoop Summit of 2010 included presentations from a number of large scale users of Hadoop and related technologies.
Hadoop峰会2010上,一系列Hadoop及其相关技术的大规模用户带来了演讲报告。
www.infoq.com
1.
This time you are extending the org. apache. hadoop. mapreduce. Reducer class and implementing its reduce method.
这一次,您将扩展org.apache.hadoop.mapreduce.Reducer类并实现其reduce方法。
www.ibm.com
2.
Hbase is built on top of Hadoop and is designed for low-latency and data mutability.
HBase是建立在Hadoop的之上的,并且具有低延时和数据可变性的设计。
blog.sina.com.cn
3.
If you mess up something, you can format the HDFS and clear the temp directory specified in hadoop-site. xml and start again.
如果弄乱了什么东西,可以格式化HDFS并清空hadoop-site.xml中指定的临时目录,然后重新启动。
www.ibm.com
4.
Note in this example that the -file options simply tell Hadoop to package your Ruby scripts as part of the job submission.
在此示例中请注意-file选项会简单地告诉Hadoop来打包您的Ruby脚本作为部分作业提交。
www.ibm.com
5.
This installation is ideal for playing with Hadoop and learning about its elements and interfaces.
这个配置非常适合体验Hadoop以及了解它的元素和界面。
www.ibm.com
6.
Recall that at the top of the Hadoop cluster is the namenode, which manages the HDFS.
位于Hadoop集群最上层的是namenode,它管理HDFS。
www.ibm.com
7.
He highlighted that Yahoo uses Hadoop to analyze every page click and optimizes rankings for content, updating the results every 7 minutes.
他强调Yahoo使用Hadoop来分析每一个页面点击并优化内容的排名,每7分钟更新一次结果。
www.infoq.com
8.
It is built using Apache Hadoop for hourly index updates and Apache HBase to provide random access to item information.
它使用ApacheHadoop来支持每小时进行的索引更新,使用ApacheHBase对随机存取信息提供支持。
www.infoq.com
9.
When working with Hadoop, accessing data in different data centers is the worst case scenario.
使用Hadoop时,访问位于不同数据中心内的数据是最糟糕的情况。
www.ibm.com
10.
Mr Sze used Yahoo's Hadoop cloud computing technology to more than double the previous record.
苏先生使用Yahoo的Hadoop云计算技术将以前的PI值计算的记录位数整个翻了一番。
www.bing.com
1.
Finally, Pig is a platform for large data set analysis that includes a high-level language for Hadoop programming.
最后,Pig是一个包括适用于Hadoop编程的高级语言的大型数据库集分析的平台。
www.ibm.com
2.
Hadoop can run analytics and hit FlockDB in parallel to assemble social graph aggregates.
Hadoop可以运行分析并找到在FlockBD中相似的社交图数据集合。
blog.sina.com.cn
3.
This article has shown you a simple example of crunching big data with Hadoop.
本文展示了使用Hadoop挖掘大数据的一个简单示例。
www.ibm.com
4.
This final article in the Hadoop series explored the development of a map and reduce application in Ruby for the Hadoop framework.
这是Hadoop系列的最后一篇文章,探索了在适用于Hadoop框架的Ruby中开发map和reduce应用程序。
www.ibm.com
5.
If you already know Java, you can probably start working through online tutorials or books on Hadoop.
如果您已经熟悉Java,您可以开始学习Hadoop上的在线教程或图书。
www.ibm.com
6.
An ecosystem has emerged to provide much-needed tooling and support around Hadoop.
一个生态系统的出现提供了围绕Hadoop的工具和支持。
www.ibm.com
7.
Hadoop is well-suited for processing huge files containing structured data.
Hadoop非常适合处理包含结构数据的大型文件。
www.ibm.com
8.
The namenode is the master server in Hadoop and manages the file system namespace and access to the files stored in the cluster.
namenode是Hadoop中的主服务器,它管理文件系统名称空间和对集群中存储的文件的访问。
www.ibm.com
9.
The massive data volumes for such tests is an example of how Facebook relies on Hadoop for data crunching.
这一类测试所产生的巨大的数据集正是Facebook使用Hadoop来处理数据的例子。
www.infoq.com
10.
You now know how to monitor a Hadoop cluster, analyze the system bottleneck using the monitoring data, and tune the performance.
您现在了解了如何监视Hadoop集群、使用监视数据分析系统瓶颈和优化性能。
www.ibm.com
1.
The biggest boost comes from having ready access to the Hadoop core developers.
最大的推动力是我们可以借用Hadoop的核心开发者。
www.infoq.com
2.
We've also been exposed to input and review from the Hadoop community at large, which is an enormous benefit.
我们也被公布于Hadoop社区,从中获取反馈,这对我们来说是好处是巨大的。
www.infoq.com
3.
Check out this list of Mapreduce and Hadoop algorithms in academic papers.
检查学术文章中的Mapreduce和Hadoop算法的列表。
www.ibm.com
4.
If you only provision a Hadoop Master node, and therefore only work on that single node, you are working in pseudo-distributed mode.
如果您只提供一个HadoopMaster节点,那么只能在一个节点上工作,即在伪分布模式下工作。
www.ibm.com
5.
I assume that this is a dedicated Hadoop box, as this step does have some security implications (see Listing 1).
我假设这是专用的Hadoop机器,因为这个步骤对安全性有影响(见清单1)。
www.ibm.com
6.
Or how about extension of JUnit for distributing tests across a Hadoop cluster?
或者试试通过Hadoop集群使用JUnit的扩展来进行分布式测试。
www.bing.com
7.
Enter the $HADOOP_HOME directory and modify configuration files on NN.
在NN上,进入$HADOOP_HOME目录并修改配置文件。
www.ibm.com
8.
HBase is a key value store akin to BigTable which stores data in Hadoop's DFS file system.
HBase是类似BigTable的键值存储模型,将数据存储于Hadoop的DFS文件系统。
www.infoq.com
9.
Azkaban is an open source workflow system for Hadoop, providing cron-like scheduling and make-like dependency analysis, including restart.
Azkaban是个面向Hadoop的开源工作流系统,提供了类似于cron的调度,类似于make的依赖分析,还包含了重启。
www.infoq.com
10.
Over the past three months, I have been teaching myself enough Hadoop to get comfortable with using the environment for analytic purposes.
在过去的三个月,我一直教我自己足够的Hadoop轻松使用的环境分析的目的。
blog.sina.com.cn
1.
You've seen how to inspect the HDFS, but if you're looking for information about Hadoop's operation, you'll find the Web interfaces useful.
您已经知道如何检查HDFS了,但是如果要寻找Hadoop的操作的相关信息,会发现Web界面很有用。
www.ibm.com
2.
Hadoop's Name Node and Job tracker reside on the master node and might not need scaling at this stage.
Hadoop的命名节点和工作跟踪器贮存在主节点上,在这个阶段可能并不需要扩充。
www.infoq.com
3.
Run the following command to format the Hadoop distributed file system to initialize.
运行以下命令对HDFS分布式文件系统进行格式化。
www.ibm.com
4.
The streaming utility within Hadoop implements a type of data flow glue.
Hadoop内的流实用工具实现了一种数据流胶的类型。
www.ibm.com
5.
The other two classes are needed for the Hadoop framework.
其他两类Hadoop所需的框架。
blog.sina.com.cn
6.
Hadoop provides some helper tools to simplify its startup.
Hadoop提供一些简化启动的辅助工具。
www.ibm.com
7.
From the IBM Cloud Control panel tab, click the Hadoop master instance.
在IBMCloudControl面板的选项卡中,单击Hadoopmaster实例。
www.ibm.com
8.
Hive is a data warehouse infrastructure built on top of Hadoop.
配置单元是一个建立在Hadoop顶部的数据舱库基础结构。
www.ibm.com
9.
hadoop-env. sh, masters, and slaves to each slave nodes; you can use SCP or another copy utility.
把hadoop-site.xml、hadoop-env.sh、masters和slaves复制到每个从节点;可以使用SCP或其他复制工具。
www.ibm.com
10.
Figure 15 shows a summary of what has been configured for the Hadoop Master node.
图15显示了HadoopMaster节点的配置内容的摘要。
www.ibm.com
1.
When there is a failure, Hadoop tries again, insulating the user from sporadic hardware faults.
当有一个失败,Hadoop继续探索、绝缘用户从零星的硬件故障。
blog.sina.com.cn
2.
Finally, explore the output using the cat file system operation through the hadoop utility (see Listing 11).
最后,通过hadoop实用工具使用cat文件系统操作来探索输出(请参考清单11)。
www.ibm.com
3.
Finally, Pig is a platform on Hadoop for analyzing large data sets.
最后,Pig是Hadoop中用于分析大型数据集的平台。
www.ibm.com
4.
Repeat the exact same process to create another Hadoop Data node; this time, name it Hadoop slave 2.
重复完全相同的过程即可创建另一个HadoopData节点;这次将它命名为Hadoopslave2。
www.ibm.com
5.
In the example, you used Hadoop to process Apache web server access logs.
在示例中,您使用Hadoop处理Apacheweb服务器访问日志。
www.ibm.com
6.
They have also released Oozie, a workflow engine for Hadoop, which has become the de-facto ETL standard at Yahoo.
他们还发布了Oozie,一个Hadoop的工作流引擎,这已在雅虎成为事实上的ETL标准。
www.infoq.com
7.
Stop all components of Hadoop with the stop-all. sh command.
使用stop-all.sh命令停止Hadoop的所有组件。
www.ibm.com
8.
Compile it and archive it in a Jar file, which will be run with the hadoop command later.
对它执行编译并存档在一个Jar文件中,后面hadoop命令将运行这个文件。
www.ibm.com
9.
Is it possible to marry the new favorite data-as-a-platform, Hadoop, with SOA, which is somewhat falling out of favor?
新兴且流行的数据即平台(Data-as-a-platform)Hadoop与一定程度上失宠的SOA的联姻,可能么?
www.infoq.com
10.
Many businesses turn to open source tools, such as Apache's Hadoop, when working with big data.
许多机构转向开源工具,比如Apache的Hadoop来处理大数据。
blog.163.com
1.
That will end looking looking something like hadoop, mogilefs or S3 - a data parallel architecture.
那样最终就是像Hadoop、Mogilefs或S3一样的东西——并行数据架构。
www.infoq.com
2.
In the first part of this series, you saw how to crunch big data using Apache Hadoop.
在本系列的第1部分中,您已看到了如何使用ApacheHadoop处理大型数据。
www.ibm.com
3.
In this case you want to provision two Data nodes to build a three-node Hadoop cluster.
在这个示例中,您需要预备两个Data节点来创建由三节点组成的Hadoop集群。
www.ibm.com
4.
Listing 3 illustrates how to use the streaming utility within Hadoop, while Figure 3 shows graphically how the flow is defined.
清单3说明如何在Hadoop内使用流实用工具,图3图形化地显示了如何定义流。
www.ibm.com
5.
Launch the start-all. sh script to start the Hadoop daemons.
启动start-all.sh脚本以启动Hadoop守护进程。
www.ibm.com
6.
Unpack the downloaded Hadoop distribution on each node; $HADOOP_HOME is used below to represent the unpack position.
在每个节点上解压下载的Hadoop发行版;下面使用$HADOOP_HOME代表解压位置。
www.ibm.com
7.
Hadoop also keeps track of which processors and disk systems are working.
也Hadoop跟踪处理器和磁盘系统的工作。
blog.sina.com.cn
8.
A Job JAR packages up all of the code and dependencies into a single JAR file for easy loading into Hadoop.
JobJAR包可以将所有代码和依赖关系打包到一个JAR文件中,以便于加载到Hadoop中。
www.ibm.com
9.
Notice that you're using a command called hadoop-0. 20 to inspect the file system.
注意,使用hadoop-0.20命令检查文件系统。
www.ibm.com
10.
Really I just want to know the numbers for each given day in any seven-day span, and I can get that information easily with Hadoop.
事实上,我只想知道在七天这样一个时间段内任何一天的地震次数,使用Hadoop我就可以很容易的获取这一信息。
www.ibm.com
1.
Instead, I think that Hadoop should be compared to a parallel dataflow style of programming.
相反,我认为Hadoop要比平行数据编程风格的。
blog.sina.com.cn
2.
You can use the cat command (after finding the particular output file) through the hadoop-0. 20 utility to emit this data (see Listing 7).
找到输出文件之后,可以通过hadoop-0.20实用程序使用cat命令查看数据(见清单7)。
www.ibm.com
3.
Because I'm running on Ubuntu (the Intrepid release), I use the apt utility to grab the Hadoop distribution.
因为我运行Ubuntu(Intrepid版),所以使用apt实用程序获取Hadoop发行版。
www.ibm.com
4.
Loading the data for Hadoop is much simpler .
Hadoop加载数据是非常简单的。
blog.sina.com.cn
5.
In this article, we will only cover setting up Hadoop in fully-distributed mode.
在本文中,我们只讨论以全分布模式设置Hadoop。
www.ibm.com
6.
I don't actually feel qualified to comment on many of the operational aspects of optimizing Hadoop code.
我真的不觉得有资格评论很多操作方面的优化Hadoop代码。
blog.sina.com.cn
7.
You'll find one namenode and one secondary namenode in a Hadoop cluster.
在每个Hadoop集群中可以找到一个namenode和一个secondarynamenode。
www.ibm.com
8.
Besides Hadoop parameters, there are also some system parameters, such as inter-rack bandwidth, which affects overall performance.
除了Hadoop参数之外,还有一些会影响总体性能的系统参数,比如机架间带宽。
www.ibm.com
9.
This is data that can be generated by the Hadoop job developed in Part 1 of this series.
这是由本系列第1部分中所开发的Hadoop作业生成的数据。
www.ibm.com
10.
Currently, there is a lack of skill in knowing how to configure and manage cloud and Hadoop technologies.
目前还缺乏关于了解如何配置并管理云和Hadoop技术的技巧。
www.ibm.com
1.
For now, we will concentrate on crunching data with Hadoop.
但现在,我们将主要关注使用Hadoop挖掘数据。
www.ibm.com
2.
The combination of cloud computing and open source programs such as the Yahoo!
云计算和开源程序(比如由Hadoop开发的YAHOO!)
www.bing.com
3.
If you provision one or more Hadoop Data nodes in addition to the Hadoop Master node, you are working in fully distributed mode.
如果除了HadoopMaster节点之外,您还提供了一个或多个HadoopData节点,那么您是在完全分布模式下工作。
www.ibm.com
4.
This is in the process of being contributed to the Hadoop open source project.
这还是一个正处于建设过程中的开源项目。
www.infoq.com
5.
Hadoop. I can include the IP database in my application jar file, which is pretty cool.
我可以包括IP数据库在我的申请jar文件,这很酷。
blog.sina.com.cn
6.
My wish, however, is that Hadoop were the parallel dataflow project.
我的愿望,不过,就是Hadoop是平行数据项目。
blog.sina.com.cn
7.
In addition to the Hadoop, a lot of Java implementation of MPI can also help to parallel single thread of tasks to scale to many nodes.
除了Hadoop,很多MPI的Java实现也可以用来将单线程的任务水平的扩展到多个节点上并行运行。
blog.163.com
8.
Before we dive into Apache Hadoop, we will give a brief introduction to the structure of the cloud computing system.
在讨论ApacheHadoop之前,我们先简要介绍一下云计算系统的结构。
www.ibm.com
9.
I guess the worst part about working with Hadoop is that the project has been going on for years without a stable 1. 0 release.
我觉得Hadoop最坏的方面是这个项目已经存在了几年都还没有一个稳定的1.0版。
www.infoq.com
10.
This is necessary, so the Hadoop Data node can be automatically added to the cluster.
为了能够将HadoopData节点自动添加到集群,该地址是必不可少的。
www.ibm.com
1.
We use Cloudera to deploy Hadoop clusters on EC2 of between 10 and 100 nodes for our data processing and analytical work.
我们使用Cloudera将Hadoop集群部署在EC2上,用于进行数据处理和分析工作,结点数介于10到100之间。
www.infoq.com
2.
With Hadoop, we can linearly scale clusters running on commodity hardware to incorporate larger and richer datasets.
借助Hadoop,我们可以线性扩展运行在商品硬件上的集群来集成更大更丰富的数据集。
www.ibm.com
3.
Notes: The parameter names listed above are all in the Hadoop 0. 20.
注意:上面列出的参数名都是Hadoop0.20.x中的;
www.ibm.com
4.
Note: If you use Hadoop release 0. 21. 0, this property name should be mapreduce. jobtracker. address.
注意:如果使用Hadoop0.21.0,这个属性名应该是mapreduce.
www.ibm.com
5.
For example, Twitter sends logging messages to Hadoop and writes the data directly into HDFS, the Hadoop Distributed File System.
比如Twitter发送登陆信息到Hadoop,并直接写入HDFS,Hadoop文件系统。
blog.163.com
6.
reported using Hadoop to sort one petabyte of data in about 16 hours (see Resources to learn more about these benchmarks).
据称雅虎使用Hadoop在16个小时内对1PB(petabyte)数据进行排序(参见参考资料更多地了解这些基准测试)。
www.ibm.com
7.
These are all important functionality for using the Hadoop framework.
这些都是重要的,专为使用Hadoop框架。
blog.sina.com.cn
8.
In this example, the Hadoop Master node IP address is 170. 224. 193. 137.
在这个示例中,HadoopMaster节点的IP地址是170.224.
www.ibm.com
9.
Its looking like we will get this feature in Hadoop 0. 18. 0.
看起来我们很可能在Hadoop0.18.0中获得这一特性。
www.infoq.com
10.
You can also extract the file from HDFS using the hadoop-0. 20 utility (see Listing 8).
还可以使用hadoop-0.20实用程序从HDFS中提取文件(见清单8)。
www.ibm.com
1.
Originally, I wrote the code using Hadoop 0. 18, because I was using the Yahoo virtual machine.
原来,我写的代码使用Hadoop0.18,因为我正在使用雅虎的虚拟机。
blog.sina.com.cn
2.
Note: If you choose to use Hadoop release 0. 21. 0, then you must use the current JDK, which is tracked by JIRA HADOOP-6941.
注意:如果选用Hadoop0.21.0,那么必须使用当前的JDK(由JIRAHADOOP-6941跟踪)。
www.ibm.com
3.
releasing federated storage across HDFS instances, in the next major Hadoop release
在Hadoop的下一个主要版本将会发布跨HDFS实例的联合存储
www.infoq.com
4.
And then install Hadoop for a pseudo-distributed configuration (all of the Hadoop daemons run on a single host)
然后,安装采用伪分布式配置的Hadoop(所有Hadoop守护进程在同一个主机上运行)
www.ibm.com
5.
Much of what Hadoop does goes unheralded by the typical MapReduce user. Hadoop
做太多的去的MapReduce总是典型用户。
blog.sina.com.cn
6.
Batch processing (time-insensitive stuff like genetics DB analysis, workflow-like workloads, Hadoop-like workloads, etc. )
批处理(时间敏感的东西,比如geneticsDB分析、类似工作流的工作负载、类似Hadoop的工作负载等)
www.ibm.com
7.
Every week they recompute their machine learning models for categories in a science Hadoop cluster
每个星期,他们在Hadoop科研集群上重新计算他们关于类别的机器学习模式
www.infoq.com
8.
So the proposed solution is to start leverage Hadoop as a cross-application data store
因此,他推荐的解决方案是使用Hadoop作为跨平台数据存储
www.infoq.com
9.
Hadoop also does a pretty good job of shuffling data around, between the map and reduce operations. Hadoop
也不错的周围把数据之间、地图和减少操作。
blog.sina.com.cn
10.
Their focus is now on developing Hadoop's distributed file system, HDFS
现在他们的重点是开发Hadoop的分布式文件系统,HDFS
www.infoq.com
1.
HBase, a column-oriented data storage environment designed to support large, sparsely populated tables in Hadoop
用于在Hadoop中支持大型稀疏表的列存储数据环境。
www.infoq.com
2.
Hive, a data warehouse infrastructure designed to support batch queries and analysis of files managed by Hadoop
用于支持Hadoop文件的批量查询和分析的数据仓库基础设施。
www.infoq.com
3.
one server in the cluster [that] should be dedicated to the following Hadoop components
集群里应该提供一个服务器给如下Hadoop组件专用
www.infoq.com
4.
The hadoop framework does not allow us to do some rather simple things. hadoop
框架的不允许我们做一些相对简单的事情。
blog.sina.com.cn
5.
Our experiment with Hadoop MapReduce and load balancing lead to two inescapable conclusions
我们的HadoopMapReduce和负载平衡的实验可以得到两个必然结论
www.ibm.com
6.
Hadoop can be configured so you work in one of three different modes
通过配置Hadoop,您可以在以下三种模式中的一种模式下工作
www.ibm.com
7.
Pig, a high-level programming language and runtime environment for Hadoop
针对Hadoop的高级编程语言及运行时环境。
www.infoq.com
8.
Step 4: Verifying your Hadoop cluster is working correctly
步骤4:确认您的Hadoop集群能够正常工作
www.ibm.com
9.
Like Cloudera's solution, IBM's BigInsights includes beside Hadoop a number of open source programs, such as
与Cloudera的解决方案类似,IBM的BigInsights包含了Hadoop以外的很多开源项目,例如
www.infoq.com
10.
Their focus in the last year was to improve Hadoop map-reduce
他们在去年的重点是改善Hadoop的map-reduce,这包括
www.infoq.com
1.
Every 5 minutes they use a production Hadoop cluster to rerank content based on recent data, updating results every 7 minutes
每隔5分钟,他们使用生产环境中的Hadoop集群基于最新数据重新排列内容,并每7分钟更新结果
www.infoq.com
2.
Dziuba gives another example. Instead of using Hadoop to process that data once you have it, you can use:
Dziuba提供了另外一个例子。与使用Hadoop来处理你所获得的信息不同,你可以使用:
www.bing.com
随便看

 

英汉双解词典包含2704715条英汉词条,基本涵盖了全部常用单词的翻译及用法,是英语学习的有利工具。

 

Copyright © 2004-2022 Newdu.com All Rights Reserved
更新时间:2025/8/19 17:53:03