New Year Special Limited Time Flat 70% Discount offer - Ends in 0d 00h 00m 00s - Coupon code: 70spcl

Huawei H13-723_V2.0 HCIP-Big Data Developer V2.0 Exam Practice Test

Page: 1 / 53
Total 526 questions

HCIP-Big Data Developer V2.0 Questions and Answers

Question 1

existFusionInsight HD where can I viewMapReduceThe result of running the application?

Options:

A.

FusionInsight Managerofwebinterface

B.

Loader Webinterface

C.

Solr webinterface

D.

Yarn Webinterface

Question 2

SparkThe calculation logic of the application will be parsed orDNG, this parsing operation is completed by which of the following function modules?

Options:

A.

Client

B.

Application Master

C.

Executor

D.

Driver

Question 3

existSparkIn application development, which of the following codes can correctly count words?

Options:

A.

val counts=textFile.map(line=>line.split(" ')).map(-rd=>(word,1)).reduceByKey(_+_)

B.

val counts=textFile.flatMap(line=>line.split(" ")).map(word=>(word,1)).reduceByKey(_+_)

C.

val counts=textFile.map(line=>line.split(" ")).map(word=>(word,1)).groupByKey()

D.

val counts=textFile.flatmap(line=>line.split(" ")).map(-rd=>(word,1)).reduceByKey()

Question 4

FusionInsight HDin, aboutOoze, which of the following operations can be performed on a suspended job? (many

select)

Options:

A.

kill

B.

run

C.

resume

D.

submit

Question 5

existStreamingin application development,BoltUse which of the following interfaces to sendTuple?

Options:

A.

to emit

B.

execute

C.

open

D.

nextTuple

Question 6

HDFSclient withNcopy toHDFSWhen writing a file, if one of the replicas fails to write, all replicas will return to writing

Entry failed.

Options:

A.

True

B.

False

Question 7

When the cluster is normal,RedisClient initiates oncegetCall, the client has () times of message interaction with the server?

Options:

A.

1

B.

2

C.

3

D.

4

Question 8

rightHBase, which of the following properties can be set? (multiple choice)

Options:

A.

Versions

B.

TTL

C.

Coprocessor

D.

Compression

Question 9

Which of the following options isYarncore functionality?

Options:

A.

storage

B.

Resource management

C.

search

D.

data transmission

Question 10

Oozieofworkflow.xmlfile is the workflow configuration file

Options:

A.

True

B.

False

Question 11

FusionInsight HDWhat are the supported ways to submitOozieOperation? (multiple choice)

Options:

A.

ShellOrder

B.

Java API

C.

Hue UI

D.

Rest API

Question 12

HDFSIn application development of , which of the following fields areHDFSInterfaces supported by the service? (multiple choice)

Options:

A.

BufferedOutputStream.write

B.

BufferedOutputStream.flush

C.

FileSystem.create

D.

FileSystem.append

Question 13

aboutKafkaThe characteristics of the following description are correct? (multiple choice)

Options:

A.

KafkaIt is a high-throughput, distributed, publish-subscribe-based messaging system

B.

KafkaPersistence of messages

C.

KafkaApplicable to offline and online message consumption scenarios

D.

Kafkaguarantee eachPartitionmessages in order

Question 14

FusionInsight HDmiddle,StreamingPackaging tools are used to package business codejarpackages and other dependenciesjarpackage, etc.

a completeStreamingAppliedjarBag

Options:

A.

True

B.

False

Question 15

Which of the following is notMapReducespecialty?

Options:

A.

easy to program

B.

good scalability

C.

real-time computing

D.

High fault tolerance

Question 16

FusionInsight HDin the system,hiveWhat are the supported interface types?

Options:

A.

JDB, Cinterface

B.

ODB, Cinterface

C.

Pythoninterface

D.

Rubyinterface

Question 17

FusionInsight HDin, useStreamingofLinuxWhen submitting a topology in command line mode, you need to first use a

StreamingUser with submit permissionkinitway authentication

Options:

A.

True

B.

False

Question 18

FusionInsight HD V100R002C60in, aboutHiveofPythonInterface type, which of the following descriptions is incorrect?

Options:

A.

RequirePythonversion no lower than2.6.6,not higher than2.7.0

B.

Client must havesetuptools, and the version is not lower than5.0

C.

hive_python_clientscript provides direct executionSQLfunctions, such ashive_python_client'show tables'

D.

PythonSecondary development does not support secure clusters

Question 19

When a task is submitted toYARNIn the cluster, which of the following options will generate for this taskTindivual JOPO

Options:

A.

ResourceManager

B.

container

C.

AppMaster

D.

NodeManager

Question 20

FlumewriteHDFSWhen the file is generated, what are the ways of generating the file? (multiple choice)

Options:

A.

Generate files directly according to the source file name

B.

Set the file generation time interval, close the old file when the time interval is reached, and the new data will be written to the new file

C.

Set a single file size, when the file size reaches the set value, the old file will be closed and the new data will be written import new file

D.

Set up single file storageeventquantity, when the file is storedeventWhen the number reaches the set value, close the old text file, new data will be written to a new file

Question 21

There are the following scenarios: new data is generated by the online system every day500G, you need to make statistics on these data by day, week, month and other dimensions summary.

ask if it is suitable for useHiveWhat kind of table to handle?

Options:

A.

Bucket table, bucket according to day

B.

Partition table, partitioned by day

C.

skew table, skew field select date, skew value random

D.

partition+Bucket table, partitioned by time, each partition2560buckets

Question 22

FusionInsight HDin the system,HiveWhat are the supported interface types? (multiple choice)

Options:

A.

JDB, Cinterface

B.

ODB, Cinterface

C.

Pythoninterface

D.

Rubyinterface

Question 23

A large-scale production enterprise plans to transform its internal logistics data and sales data into big data. The design department provides data storage

Analysis of the program, which of the following statements are correct?

Options:

A.

HBaseIt can store massive data and support dynamic expansion, which can fulfill the storage requirements of logistics and sales data.

B.

HbaseSupport colleges and universities to read immediately, and can complete real-time analysis and command of the logistics situation through reasonable design.

C.

Logistics data is very sparse,HBaseCan handle sparse data efficiently.

D.

HBaseIt can be built on ordinary commercial servers, and the construction cost is relatively low.

Question 24

FusionInsight HDin, aboutOoze, which of the following operations can be performed on a suspended job? (many

select)

Options:

A.

kill

B.

run

C.

resume

D.

submit

Question 25

When a task is submitted toYARNIn the cluster, which of the following options will generate ajob id?

Options:

A.

ResourceManager

B.

Contaioner

C.

AppMaster

D.

NodeManager

Question 26

FusionInsight HDin, aboutHiveandSparksql, which of the following descriptions are correct? (multiple choice)

Options:

A.

Both are available externallySQLor classSQLability

B.

Both can run onMapReduceon the computational framework

C.

Both are available via table fields andHBaseAccess by field mappingHBasedata on

D.

Both can run onSparkon the computational framework

Question 27

aboutStreamingthe topology (Topology), which of the following descriptions is wrong?

Options:

A.

OneTopologyis made up of a group ofSpoutcomponents andBoltcomponents passStream GroupingsA connected directed acyclic graph (DAG)

B.

Topologywill run until it is explicitlykill

C.

Business logic is encapsulatedTopologymiddle

D.

OneTopologyOnly one can be specified to startWorkerprocess

Question 28

FusionInsight HDin, yesSolrThe creation of various resources and the use of read and write permissions, which of the following statements is wrong?

Options:

A.

SolrUser group users can create configuration sets and createCollectionwhen specifying the configuration set

B.

SolrUsers of the user group can createCollection

C.

Collectioncreator, theCollectionroles with read and write permissions andSolr adminrole can beCollectionEnter

row update

D.

onlyCollectioncreator andSolr adminRoles can be deletedCollection

Question 29

existFusionInsight HDcluster, aboutkinitOperation command, which of the following statements is wrong? (multiple choice)

Options:

A.

Only the HMI account can be used.

B.

Only the machine account can be used.

C.

A client does not support the simultaneous use of multiple accounts.

D.

The ticket obtained by executing this command is intwenty fourIt will time out when it is small and needs to be executed againkinitcommand to log in again.

Question 30

for running onMapReduceThe application on the platform that this application depends onjarWhere will the bag be placed?

Options:

A.

HIVE

B.

HBASE

C.

HDFS

D.

DB

Question 31

FusionInsight HD in which ways can you viewOozieDebug results of the job? (multiple choice)

Options:

A.

CheckOozieofJava APIWhether the returned result is as expected

B.

pass throughHueofworkflowDashboard to view program running results

C.

pass throughHue's file browser to see ifHDFSThe specified directory produces the expected file

D.

Oozieself-containedWebThe interface can also view the job execution results

Question 32

forFusionInsight HDplatformHBaseComponent, which properties of the secondary index need to be defined to add a secondary index? (multiple choice)

Options:

A.

index name

B.

index column

C.

index column type

D.

The name of the column family to which the indexed column belongs

Question 33

HDFSThe system time of the node where the client is located is the same as theFusionInsight HDThe system time of the cluster should be consistent. If there is a time difference, So the time difference should be less than a few minutes?

Options:

A.

5

B.

10

C.

20

D.

30

Question 34

rightHBase, which of the following properties can be set? (multiple choice)

Options:

A.

Versions

B.

TTL

C.

Coprocessor

D.

Compression

Question 35

existMapReduceIn the development framework,InputFormatWhat is the function of the class?

Options:

A.

mergemapThe result of the stage has the samekeyvalue key-value pair

B.

put allmapThe results of the stages are merged to get the final result

C.

rightmapThe results of the stages are sorted and merged

D.

Split the input data into individualsplit, and willsplitfurther split intoyes, then asmapletter input of numbers

Question 36

A project requires Internet access to a certain area? ?Save it, and search the full text of these Internet records to see if there is any? ?information, with

to prevent crime in the region.

In this scenario, which of the following options is the best?

Options:

A.

existSolrCreate an index and save the data, and return all the data during full-text search.

B.

existHBasestore data on theHBaseThe filtering characteristics of , satisfy fuzzy matching query.

C.

When storing data, inSolrindex on theHBaseStore complete data? ?when, throughSolrFull-text search to obtain and record

record key information, through the key information inHBasefor the full record.

Question 37

existStreamingin application development,BoltUse which of the following interfaces to sendTuple?

Options:

A.

to emit

B.

execute

C.

open

D.

nextTuple

Question 38

Fusionlnsigt HD which of the following components can be used for data collection? (multiple choice)

Options:

A.

Flume

B.

Loader

C.

HBase

D.

MapReduce

Question 39

aboutFusionInsight HDplatformHiveservice, itsWebHCatDevelopment interface, which of the following descriptions is incorrect?

Options:

A.

Support based onRESTquery request

B.

WebHCatThe return data format isSML

C.

WebHCatbased onHTTPandHTTPSAgreement to provide services to the outside world

D.

able to passWebHCatCreate tables, query, etc.

Question 40

Suppose there is an application that needs to be accessed frequentlyOracleThe user table in the database, in order to improve performance, introduceRedisto cache users

information.

For this scene,RedisWhich of the following is the best data structure choice for ?

Options:

A.

list

B.

set

C.

hash

D.

sorted set

Question 41

forSpakstreamingapplication, in aJVM, there can only be one at a time

Streamingcontextis active.

Options:

A.

True

B.

False

Question 42

HDFSmiddleNameNodeUsed to store metadata for the file system.

Options:

A.

True

B.

False

Question 43

FusionInsight HD which components are provided externallySQLor classSQLability? (multiple choice)

Options:

A.

HBase

B.

Hive

C.

Spark

D.

Redis

Question 44

HBasetablerowkeyDesign is a very important development and design link. Suppose there is the following scenario,

The most frequent query scenario is to query the historical call records of each month and half a year based on the mobile phone number. Which of the followingrowkey

Design is optimal?

Options:

A.

name ten mobile number

B.

date ten phone number

C.

Phone number+date

D.

Phone number+Name

Question 45

existBaseIn application development, when a table'sRowkeyRange and distribution are known, pre-score is recommendedregion,

Please call the following code (fragment) to pre-sort a tableregionAfter this table will create severalregion?

splits【0】=Bytes.toBytes("A");splits【1】=Bytes.toBytes("H");splits【2】

=Bytes.toBytes("O");splits【3】=Bytes.toBytes("U");admin.createTable(htd,splits);

Options:

A.

3

B.

4

C.

5

D.

6

Question 46

Which of the following components must depend onZooKeeperto run()?

(multiple choice)

Options:

A.

HDFS

B.

HBase

C.

Spark

D.

YARN(streaming)

Question 47

existSparkmiddle,SparkSQLis an independent module that does not depend onSparkCoretstandtasQaon.eaR. operation, etc.

Options:

A.

True

B.

False

Question 48

existHBaseWhile the application is running, the application can write data while creating the table.

Options:

A.

True

B.

False

Question 49

FusionInsightHD which components are provided externallySQLor classSQLability? (multiple choice)

Options:

A.

mouthHBase

B.

mouthHive

C.

mouthSpark

D.

mouthRedis

Question 50

FusiontnsightHD in which ways can you viewOozieDebug results of the job? (multiple choice)

Options:

A.

mouth viewOozieofJava APIWhether the returned result is as expected

B.

mouth throughHueofworkflowDashboard to view program running results

C.

mouth throughHue's file browser to see ifHDFSThe specified directory produces the expected file

D.

mouthOozieself-containedWebThe interface can also view the job execution results

Question 51

existFusionInsight HDcluster, aboutkinitOperation command, which of the following statements is wrong? (many select)

Options:

A.

Only use HMI account.

B.

Only the machine account can be used.

C.

A client does not support the simultaneous use of multiple accounts.

D.

The ticket obtained by executing this command is intwenty fourIt will time out when it is small and needs to be executed againkinitcommand to log in again.

Question 52

Regarding the Hive service of the FusionInsight HD platform and its WebHCat development interface, the following description does not

the correct one is

Options:

A.

Support for REST-based query requests

B.

The returned data format of WebHCat is XML

C.

WebHCat provides external services based on HTTP and HTTPS protocols

D.

You can create tables, query and other operations through WebHCat

Question 53

HBasedata fileHFileone ofKeyValueWhat information does the format contain?

(multiple choice)

Options:

A.

Key

B.

Value

C.

timeStamp

D.

KeyType

Question 54

existMapReduceIn application development of , which of the followingJAVAThe class is responsible for managing and running a Counting tasks?

Options:

A.

Job

B.

Context

C.

FileSystem

D.

Configuration

Question 55

existSolrWhen indexing, which of the following code snippets can significantly improve indexing performance? (multiple choice)

Options:

A.

mouthIbClient.setRequestWriter(new BinaryRequestWriterO);

B.

mouthif(updateRequest.getDocumentsMapO.size)>=10000){

cloudSolrClient.request(updateRequest,collName);updateRequest.clear);

C.

mouthif(updateRequest.getDocuments).size(O>=1000){

D.

cloudSolrClient.request(updateRequest,collName);updateRequest.clearO;

S8tW9H2xgntdc=new SolrInputDocument);

doc.addField("id",i.tostring( )doc.addField("name", "Zhang San4i);

doc.addField("features","test"+i):

doc.addreldc"price, (float)*1.oi);

cloudSolrClient.add(doc);

cloudSolrClient.commitO:

Question 56

HBaseWhich application scenarios are not suitable for?

Options:

A.

requires fullA, CIDFeature application scenarios

B.

Mass data application scenarios

C.

High throughput application scenarios

D.

Semi-structured data application scenarios

Question 57

TowardsHBaseincrease in the clusterRegionServerhost, the original cluster must be stopped first, becauseHBase Dynamic expansion is not supported.

Options:

A.

True

B.

False

Question 58

Options:

A.

The port preferentially downloads data from the nearest DataNode.

B.

The port file data is first returned from the DataNode to the NameNode, and then downloaded from the NameNode to the client.

C.

If the client and the connected DataNode fail while reading, the client will abandon the failure

node, to connect to the node where its replica is located.

D.

The port supports multiple clients reading the same file data from the DataNode at the same time.

Question 59

FusionInsight HDmiddle,OozieBefore submitting the job, you need to upload the configuration files that this job depends on andjarpackage toHDFS

Options:

A.

True

B.

False

Question 60

existMapReduceIn application development of , which of the followingJAVAThe class is responsible for managing and running a

Counting tasks?

Options:

A.

Job

B.

Context

C.

FileSystem

D.

Configuration

Question 61

Spark on YARNin modedriverOnly works on the client side.

Options:

A.

True

B.

False

Question 62

FusionInsight HDofHBasesave a user information table inmeg table, Rowkeyfor usersid,

One of the columns is the user's nickname, now write three to this column in orderKeyValue:001:Li,001:Mary,

001:Lily,Excuse mescan'meg table,{VERSIONS=>2}What data will be returned?

Options:

A.

001:Li

B.

001:Lily

C.

001:Li,001:Mary,001:LiLy

Question 63

which come downRedisIs the command an atomic operation?

Options:

A.

INCR(increment the stored digital value by one)

B.

HINCRBY(for the hash tablekeydomain infieldvalue plus incrementincrement

C.

LPUSH(for linked listspushoperate)

D.

All of the above

Question 64

FusionInsight HDin the system,FumeWhich component does the data flow not need to go through within the node?

Options:

A.

Source

B.

channel

C.

sink

D.

topic

Question 65

FusionInsight HDmiddle,StreamingPackaging tools are used tojarpackages and other dependencies jarpackage, etc., typed into a completeStreamingAppliedjarBag

Options:

A.

True

B.

False

Question 66

aboutFlumeThe characteristics of the collected data, which of the following descriptions are correct?

Options:

A.

Supports distributed and streaming acquisition

B.

with the help ofMapReducetask parallel collection

C.

Only structured data can be collected

D.

Trigger data collection with external timing tasks

Question 67

FusionInsight HDmiddle,StreamingWhat are the characteristics of? (multiple choice)

Options:

A.

stream processing

B.

high performance

C.

Scalable

D.

High fault tolerance

Question 68

FusionInsightHDin, aboutHivepartition (partition) function, which is wrong as described below?

Options:

A.

Partition fields are defined when the table is created

B.

There can only be one partition field, and multi-level partitions cannot be created

C.

Using partitions can reduce the data scan range of some queries, thereby improving query efficiency

D.

Partition fields can be used aswherecondition of the sentence

Question 69

Oozieofworkflow.xmlfile is the workflow configuration file

Options:

A.

True

B.

False

Question 70

HDFS"Write once, read many"__________File access model. Therefore, it is recommended that a document

Options:

A.

After creating,

B.

writing and closing,

C.

do not modify it again.

D.

right or wrong

Question 71

FusionInsight HDin useStreamingofA, CKWhich of the following statements is true

of? (multiple choice)

Options:

A.

After enabling Acker, Streaming will identify the failed Tuple and automatically resend it without human intervention pre.

B.

Acker will identify the processing timeout or processing failed messages as fail.

C.

Starting from the spout, failure of any link in the formed tuple tree will mark the entire tree as failure.

D.

The application needs to implement the message resend logic in the fail) interface method of spout.

Question 72

existSolrIn the application query scenario of the

Page. Which of the following statements about cursors is correct?

(multiple choice)

Options:

A.

The query field must be sorteduniqueKeySort by field

B.

For the first query request, you need to specify query parameters. The operation method is as follows:solrquery.set

(CursorMarkParams.CURSOR_MARK_PARAM,CursorMarkParams.CURSOR_MARK_

START);

C.

When querying, the request cannot containstartparameter

D.

SorlThe return result will havenextCursorMark, on the next call use this value ascursorMark

Question 73

aboutRedisFeatures, the following description is correct? (multiple choice)

Options:

A.

low latency

B.

Rich data structures

C.

Support data persistence

D.

supportSQLquery interface

Question 74

Hadoopin the systemYARNassigned toContainerThe memory size, which can be passed through the parameter

yarn.app.mapreduce.am.resource.mb(yarn.nodemanager.resource.memory-mb) to set.

( )

Options:

A.

True

B.

False

Question 75

Sparkis a memory-based computing engine, allSparkData during program operation can only be stored in

in memory.

Options:

A.

True

B.

False

Question 76

HDFSofClientWhen writing to a file, the first copy of the data is written to the location specified byNameNodeSure,

The other replicas are written to byDataNodeSure.

Options:

A.

True

B.

False

Question 77

FlumewriteHDFSWhen the file is generated, what are the ways of generating the file? (multiple choice)

Options:

A.

Generate files directly according to the source file name

B.

Set the file generation time interval, close the old file when the time interval is reached, and the new data will be written to the new file

C.

Set the size of a single file, when the file size reaches the set value, the old file will be closed, and the new data will be written to the new file

D.

Set up single file storageeventquantity, when the file is storedeventWhen the number reaches the set value, the old file will be closed, and the new data will be

is written to a new file

Question 78

FusionInsigt HDWhat distributed computing frameworks do big data platforms provide?

(multiple choice)

Options:

A.

HBase

B.

Sparkmemory-based distributed computing

C.

MapReduceDistributed offline batch computing engine

D.

StreamingDistributed stream computing engine

Page: 1 / 53
Total 526 questions