New Year Special Limited Time Flat 70% Discount offer - Ends in 0d 00h 00m 00s - Coupon code: 70spcl

Huawei H13-711_V3.0 HCIA-Big Data V3.0 Exam Practice Test

Page: 1 / 65
Total 649 questions

HCIA-Big Data V3.0 Questions and Answers

Question 1

When the Fusioninsight HD cluster is deployed in a three-layer network, it is recommended that the management node, control node, and data node be installed in different network segments to improve reliability.

Options:

A.

True

B.

False

Question 2

The following aboutHB? ?The description of the storage type is correct?

Options:

A.

EveryKeyVa? ?have a Qualifier logo

B.

Even Eay?Same, Qualsfier also same multiple kary Valun. There may also be multiple values, in which case the time is used?to distinguish

C.

The same key value can be associated with multiple velues

D.

Keyvalise?There are key information such as timestamp, type, etC.

Question 3

Under the HDFS federation mechanism, metadata between NameNodes is not shared.

Options:

A.

True

B.

False

Question 4

The nodes in the ElasticSearch cluster are divided into master and slave.

Options:

A.

True

B.

False

Question 5

What is the default resource scheduler in YARN?

Options:

Question 6

Each stagel of a Spark task can be divided into jobs, and the division mark is shuffle.

Options:

A.

True

B.

False

Question 7

In the flume architecture, a Source can be connected to multiple channels.

Options:

A.

True

B.

False

Question 8

Spark on YARN-clienti is suitable for production environment because it can see the output of APP faster.

Options:

A.

True

B.

False

Question 9

The following are typical application scenarios of DWS

Options:

A.

Enterprise Data Warehouse

B.

Trading System

C.

data mart

D.

CRM/ERP

Question 10

ElasticSearch can be used as a relational database similar to MySQL.

Options:

A.

True

B.

False

Question 11

Tez is a distributed computing box that supports directed acyclic graphs. When Hive uses the Tez engine for data analysis, it parses the HQL statements submitted by users into corresponding Tez tasks and submits them to Tez for execution.

Options:

A.

True

B.

False

Question 12

In the Fusionlnisght HD system, the number of Partitions and the number of replicas must be set when creating the Topich of Kafkal. Setting up multiple replicas can enhance the disaster tolerance of the Kafka service.

Options:

A.

True

B.

False

Question 13

When Solr creates CollectionE, it is recommended to select the routing algorithm as compositld Router, then the Collection can expand shard.

Options:

A.

True

B.

False

Question 14

In Zookeeper's service model, the Leader node exists in the active-standby mode. All other nodes belong to the Follower node.

Options:

A.

True

B.

False

Question 15

A Kafka cluster contains one or more service instances, which are called( )

Options:

Question 16

Which of the following are the built-in string functions of Hive?

Options:

A.

trim()

B.

abs ()

C.

substr()

D.

length()

Question 17

a big data? ?Real-time users in processing statistics?data, the following?No?within a minute?data is grouped?What is the function of

Options:

A.

owea? ?

B.

? ? ?

C.

Tw? ?

D.

fre? ? ?

Question 18

The driver in Spark On YARN mode can only run on the client.

Options:

A.

True

B.

False

Question 19

In Streaming, exactly one message reliability level is achieved through the ACK mechanism.

Options:

A.

True

B.

False

Question 20

HDFSWhat are the benefits of abstract blocks?

Options:

A.

Suitable for data backup

B.

Support large-scale file storage

C.

Simplify system design

D.

satisfyI0Performance requirements for intensive applications

Question 21

Mapme?program by? ?consists of two parts,? ?program, which has?Piece?Task. Please program the most? ?banknotes? ?

Options:

A.

2

B.

3

C.

5

D.

4

Question 22

inWebHCatin the architecture. Users can use the secureHTTPSprotocolexecuteDo which of the following actions?

Options:

A.

implementHive DDLoperate

B.

runMap Reduce tasks

C.

runHiveHQL tasks

D.

all of the abovecorrect

Question 23

Fusioninsight tool is a set of health detection tools provided for technical support engineers and maintenance engineers. It can check the health status of cluster-related nodes and services, discover potential problems in the cluster in advance, and generate health check reports. It is convenient for technical support engineers and maintenance engineers to quickly understand the health status of the system.

Options:

A.

True

B.

False

Question 24

A Key Value format in the HBasel data file HFiler contains Key; Value, TimeStamp, KeyType, etC.

Options:

A.

True

B.

False

Question 25

If you want to add 1 to the digital value stored in the Key, which command should you use?

Options:

Question 26

The following aboutHiveComponent capabilities in the architecture. Which is the correct description?

Options:

A.

ThriftServer for thriftcatchmouth, as JDBCservices and willHive and other applicationsintegrated

B.

Compiler pressAccording tomissionaccording toDependent relationship is executed separatelyMap/Raduce task

C.

Executor is responsible for editingTranslate voQLandwillwhich translates into a series of mutualrelyofMap/ReduceTask

D.

Ooptimizer is an optimizer, divided into logicalJiyouoptimizer and physical optimizer.HiveQLGenerated execution planMapRadceTaskEnterrow optimization

Question 27

In the Loader of Fusioninsight HD, one connector can only be assigned to one joB.

Options:

A.

True

B.

False

Question 28

The HDFS data reading process includes the following steps, please choose the correct order. (Drag picture title, sort question)

Options:

A.

After obtaining this input stream, the client calls the read method to read the data. The input stream selects the nearest DataNode to establish a connection and read data.

B.

The client calls close. to close the input stream.

C.

The location where the data block corresponding to this file in NameNodel is obtained by calling NameNode remotely through RPC

D.

If the end of the data block has been reached. Then close the connection with this DataNodel, and then re-find E. a data block. until all data is read.

E.

The client calls the open method of the FileSystem instance to obtain the input stream corresponding to the file.

Question 29

Similar to Spark Streaming, Flink is an event-driven real-time streaming system

Options:

A.

True

B.

False

Question 30

Which of the following operations cannot be recorded in the Fusioninsight HD system audit log?

Options:

Question 31

The client is a user operationHDFSThe most common way. the following aboutHDFSThe description of the client issureyeswhich?

Options:

A.

customerendis HDFSpart of isDeploy HDFSmustGrouppiece

B.

HDFS guestThe client provides something likeshe 11'sOrderVisit by wayAsk HDFSnumber inaccording to

C.

HDFSThe client is a library,IncludeHDFS filePartssysteminterface, thesecatchmouth hiddenHDFSMost of the complex in the implementationmiscellaneoussex

D.

customerendcan support opening,Common operations such as read and write

Question 32

Elastic SearchofSowleadCanbystoragein a variety ofstorage classtype,andthe followingwherekind of storagekindtypebranchhold?

Options:

A.

Shared file system

B.

Object Storage

C.

HDFS

D.

Local file system

Question 33

Hadoop's NameNode is used to store the metadata of the file system.

Options:

A.

True

B.

False

Question 34

In Kafka HA, when the leader corresponding to the partition is down, a new leader needs to be elected from the followers. Which of the following roles should be executed?

Options:

A.

Follower

B.

Controller

C.

Brocker

D.

Leader

Question 35

In the Output stage, Structured Streaming can define different data writing methods, including which of the following methods?

AAppend Mode

B. Update Mode

C. General Mode

D. Ccomplete Mode

Options:

Question 36

Which of the following contents can be viewed in the Loader historical job record?

Options:

A.

job status

B.

Job start/run time

C.

dirty data link

D.

Error lines/number of files

Question 37

Regarding the task selection of the capacity scheduler, which of the following statements is true( )

Options:

A.

Minimum queue level first

B.

Resource recycling request queue priority

C.

Maximum queue level first

D.

The queue with the lowest resource utilization takes precedence

Question 38

In the FusininsightHD platform, which components support list encryption?

Options:

A.

HDFS

B.

Flink

C.

HBase

D.

Hive

Question 39

The following figure shows the label storage strategy of HDFS. Observe the figure below, which data nodes will HBasel data be stored on

Options:

A.

DataNode A

B.

DataNode B

C.

DataNode E

D.

DataNode F

Question 40

The following figure shows the computational model of Structured Streaming. By observation, it can be concluded that the final calculation result of 3 is

Options:

A.

Dog 1, owl 1

B.

Cat 2, dog 4, owl 2

C.

Cat 2, dog 3, owl 1

D.

Cat 1, cat 1, dog 2, dog 2, owl 2

Question 41

Which of the following descriptions about the functions of HMaster in HBase are correct?

Options:

A.

Region load balancing, Region splitting and Region allocation after splitting

B.

Responsible for creating tables, modifying tables, deleting tables

C.

Responsible for load balancing of RegionServer

D.

Region after RegionServer failsmigrate

Question 42

Which parts of the data need to be read to execute the HBase data reading business?

Options:

A.

HLog

B.

MemStore

C.

HFile

D.

HMaster

Question 43

Which of the following are the functions that Spark can provide?

Options:

A.

Distributed memory computing engine

B.

Distributed file system

C.

Unified scheduling of cluster resources

D.

Stream processing capabilities

Question 44

What configuration files can the Fusioninsight HD LLD configuration planning tool generate?

Options:

A.

Monitoring Alarm Threshold Profiles

B.

Cluster installation template file

C.

Configuration files for HDFS and YARN

D.

Configuration file Check required to execute Precheck Nodes.Config

Question 45

Which of the following scenarios is not applicable to Hive?

Options:

A.

Real-time online data analysis

B.

Non-real-time analysis, such as log analysis, statistical analysis

C.

Data mining, such as user behavior analysis, interest division, regional display

D.

Data aggregation, such as daily, weekly user clicks,Click to rank

Question 46

In Huawei's big data solution, LadpServer, as a directory service system, can implement centralized management of accounts on the big data platform. The following statement about LdapServer is correct:

Options:

A.

LdapServer supports TCP/P protocol.

B.

LdapServer is a specific open source implementation based on the LDAP standard protocol.

C.

LdapServerl uses Berkelay DB as the default backend database.

D.

LdapServer is implemented based on OpenLDAP open source technology

Question 47

This command in Hive "ALTER TABLEemployeelADDcolumns(columnlstring);"What does it mean?

Options:

A.

delete table

B.

add column

C.

create table

D.

Modify file format

Question 48

In the era of big data, which of the following challenges are faced by enterprises?

Options:

A.

Data is scattered among various departments of the enterprise, and the same data is stored in different formats within each department.

B.

Diversified data structures.

C.

technological advancements of competitors.

D.

Scattered data has problems such as noise, missing, and non-standard storage types, which requires a lot of data preprocessing.

Question 49

Starting from version 2.7.3 of HDFS, what is the default Block Size?

Options:

A.

32MB

B.

128MB

C.

64MB

D.

16MB

Question 50

Which of the following options does the time operation type supported by F1ink include?

Options:

A.

End Time

B.

processing time

C.

Acquisition time

D.

event time

Question 51

Which of the descriptions of the Loader job in FusionlnsightHD is correct?

Options:

A.

After the Loader submits the job to Yam for execution, if the Loader service is abnormal at this time, the job execution fails.

B.

After the Loader submits the job to Yam for execution, if a Mapper task fails to execute, it can automatically retry

C.

After the Loadet job fails to execute, garbage data will be generated, which needs to be cleared manually by the user

D.

Loader submits a job to Yam for execution. No other jobs can be submitted until the job is executed.

Question 52

What are the successful cases of Huawei Fusioninsight HD in the industry?

Options:

A.

digital government

B.

Smart Park

C.

smart transportation

D.

finance

Question 53

Which of the following indicators belong to flume data monitoring?

Options:

A.

The amount of data received by the source

B.

The amount of data written by the sink

C.

Number of DataNodes

D.

The amount of cached data in the channel

Question 54

In the HDFS federated environment, which of the following contents are included in the NameSpace

Options:

A.

content

B.

document

C.

piece

D.

None of the above

Question 55

In the FusionInsight cluster, which of the following components does Spark mainly interact with?

Options:

A.

Hive

B.

YARN

C.

HDFS

D.

Zookeeper

Question 56

What methods or interfaces does Loader provide to implement job management?

Options:

A.

WEB UI

B.

Linuxt command line

C.

REST interface

D.

Java API

Question 57

When installing a Fusionlnsight HD cluster in safe mode, which components must be installed?

Options:

A.

Zookeeper

B.

LDAPServer

C.

KrbServer

D.

HDFS

Question 58

In a Huawei Fusioninsight HD cluster, which of the following services can the Spark service read data from?

Options:

A.

YARN

B.

HDFS

C.

Hive

D.

HBase

Question 59

In the Fusioninsight product, which statement is correct about the Kafka component?

Options:

A.

When creating a topic, the number of replicas must not be greater than the number of currently surviving Broker instances, otherwise the topic creation will fail

B.

When the Producer of Kafkal sends a message, it can specify which Consumer consumes the message

C.

Kafka will store metadata information in Zookeeper for

D.

After Kafka is installed, the sensitive data storage directory cannot be configured.

Question 60

When executing the HBases data reading business, which parts of the data need to be read?

Options:

A.

HFile

B.

HLog

C.

MemStore

D.

HMaster

Question 61

Which of the following descriptions about the characteristics of Kafka Partition replicas is correct?

Options:

A.

Follower synchronizes data from Leader by pulling

B.

The master copy is called Leader, and the slave copy is called Follower

C.

Both consumers and producers read and write data from the Leader, and can also interact directly with the Follower

D.

Replicas are in units of partitions. Each partition has its own slave replica of the master

Question 62

Are the following descriptions correct about the HBase file storage module (HBase FileStream, HFS for short)?

Options:

A.

Applied in the upper layer of Fusioninsight HD

B.

HFS encapsulates the interface between HBase and HDFS

C.

Provide functions such as file storage, reading, and deletion for upper-layer applications

D.

HFS is a separate module of HBase

Question 63

After submitting the topology using the Streaming client shell command in the Fusioninsight HD system, use Strom The UI view shows that the topology has not processed data for a long time. What are the possible reasons?

Options:

A.

Supervisor is the component that receives data in topology and then performs processing

B.

There is a logic error in the topology business, and it cannot run normally after submission

C.

The topology is too complex or the number of concurrent users is too large, resulting in workerThe startup time is too long, exceeding the waiting time of Supervisort

D.

The supervisor's slots resources are exhausted, and after the topology is submitted, the slots cannot be allocated to start the worker process.

Question 64

What are the main features of HBase?

Options:

A.

High reliability

B.

high performance

C.

column oriented

D.

adjustable

Question 65

Regarding Flume, which of the following statements is false?

Options:

A.

Data transfer between Flume cascade nodes supports encryption

B.

Flume supports multi-cascading and multiplexing

C.

There is a need for encryption inside processes such as Source to Channel to Sink

D.

Data transfer between Flume cascade nodes does not support compression

Question 66

The ZKFC process is deployed on the following node in HDFS?

Options:

A.

Active NameNode

B.

Standby NameNode

C.

DataNode

D.

All of the above are wrong

Question 67

Which of the following options does the main role of Zookeeper in distributed applications not include?

Options:

A.

Election of Master Nodes

B.

Ensure data consistency on each node

C.

Allocate cluster resources

D.

Store server information in the cluster

Question 68

What is the resource management framework that comes with Spark?

Options:

A.

Standalone

B.

Mesos

C.

YARN

D.

Docker

Question 69

When the loader in Fusioninsight HD imports files from the SFTP server, which of the following file types does not require encoding conversion and data conversion and is the fastest?

Options:

A.

sequence_file

B.

text_file

Cbinary_file

C.

graph_file

Question 70

What does Fusioninsight HD HBase use by default as its underlying file storage system?

Options:

A.

HDFS

B.

Hadoop

C.

Memory

D.

MapReduce

Question 71

Which of the supervisor descriptions of Fusioninsight HD Streaming is correct?

Options:

A.

Supervisor is responsible for resource allocation and task scheduling

B.

Supervisort is responsible for accepting tasks assigned by Nimbus, starting and stopping Worker processes that belong to its own management

C.

Supervisor is a process that runs specific processing logic

D.

Supervisor is a component that receives data in Topology and then performs processing

Question 72

Which of the following factors contributed to the vigorous development of the era of big data?

Options:

A.

Reduced hardware costs and increased network bandwidth

B.

The rise of cloud computing

C.

The popularization of smart terminals and the improvement of social demands

D.

all of the aboveA. True

Question 73

Which of the following is not a core element of KrbServer?

Options:

A.

KDC (Key Distribution Center)

B.

Kerberos Client

C.

Kerberos KDC Client

D.

Kerberos KDC Server

Question 74

The picture below shows Sparke&MapReduce performance comparison data, it can be concluded that compared with MapReducei computing, Spark uses( )resources, get( )double the performance?

Options:

A.

1/8, 3

B.

1/10, 3

C.

1/10, 4

D.

1/8, 4

Question 75

Which of the following descriptions about HBase. Secondary Index is correct

Options:

A.

The secondary index associates the column to be searched with the rowkey into an index table

B.

At this point, it is listed as a new rowkey; the original rowkey becomes the value

C.

The secondary index is queried twice

D.

all of the above

Question 76

In the Fusioninsight HD product, which statement about Kafka is incorrect?

Options:

A.

Kafka strongly depends on Zookeeper

B.

The number of instances deployed by Kafka must not be less than 2

C.

Kafkal server can generate messages

D.

Consumer consumes messages as the client role of Kafkal

Question 77

Which configuration is not supported by Fusioninsight Manager user rights management?

Options:

A.

Assign roles to users

B.

Configure permissions for roles

C.

Assign roles to user groups

D.

Configure permissions for user groups

Question 78

In the MRS platform, which component does the F1ume data flow not need to pass through in the node?

Options:

A.

Sink

B.

Channel

C.

LTopic

D.

Source

Question 79

To set the maximum resource usage of QueueA in YARN, which parameter needs to be configured?

Options:

A.

yarn.scheduler.capacity.root.QueueA. user-limit-factor

B.

yarn.scheduler.capacity.root.QueueA. minimum-user-limit-percent

C.

yarn.scheduler.capacity.root.QueueA. state

D.

yarn.scheduler.capacity.root.QueueA. maximum-capacity

Question 80

Which of the following statements about Flink barriers is wrong

Options:

A.

Barriers are periodically inserted into the data flow and flow with it as part of the data flow

B.

Barriers are at the heart of Flink snapshots

C.

A barrier separates the snapshot data of the current cycle from the snapshot data of the next cycle

D.

When the barrier is inserted, it will temporarily block the data flow

Question 81

The underlying data of HBase exists in the form of?

Options:

A.

keyvalue

B.

column store

C.

row storage

D.

real-time storage

Question 82

In Hive, which of the following statements about partitions is incorrect

Options:

A.

There can be further partitions or buckets under the partition

B.

The data table can be partitioned by the value of a field

C.

Each partition is a directory

D.

The number of partitions is fixed

Question 83

As shown in the figure, the following description of the message read by the Kafka message consumer Consumeri is wrong?

Options:

A.

The blue in the picture is a topic of Kafkal, which can be understood as a queue, and each grid represents a message.

B.

The messages generated by the producer are placed at the end of the topic one by one.

C.

Consumers read messages sequentially from right to left.

D.

Consumert uses offset to record the position of the read

Question 84

Which of the following types of data is not semi-structured data?

Options:

A.

HTML

B.

XML

C.

two-dimensional table

D.

JSON

Question 85

In the F1ink technical architecture,( )is a computing engine for stream processing and batch processing

Options:

A.

Standalone

B.

Runtime

C.

DataStream

D.

FlinkCore

Question 86

In the Fusioninsight HD system, which component does the flume data flow not need to pass through in the node?

Options:

A.

sink

B.

topic

C.

Source

D.

Channel

Question 87

Which of the following options are suitable for MapReduce?

Options:

A.

Offline computing

B.

real-time interactive computing

C.

Iterative calculation

D.

Streaming Computing

Question 88

What is the physical storage unit of Region in HBasel

Options:

A.

Region

B.

ColumnFamily

C.

olumn

D.

Row

Question 89

Which of the following commands downloads directory files from HDFS to local?

Options:

A.

dfs-cat

B.

dfs -mkdir

C.

dfs -get

D.

dfs-put

Question 90

Which of the following descriptions about the basic operations of Hive SQL is correct?

Options:

A.

When loading data into Hive, the source data must be a path in HDFS

B.

To create an external table, you must specify location information

C.

Column delimiters can be specified when creating a table

D.

Create an external table using the external keyword. To create a normal table, you need to specify the internal keyword

Question 91

Fusioninsight Hadoop:In the cluster, query through df-hT on a node. The partitions you see include the following: /var/log/srv/BigData/srv/BigData/hadoop/data5/srv/BigData/solr/solrserver3/What is the optimal RAID level planning combination for the disks corresponding to the srv/BigData/dbdadom partitions?

Options:

A.

Raido Raid1 Raido Non-Raidl

B.

Raidl Raid1 Non-RaidNon-Raid Raidl

C.

Raido raido Raido Raido raid0

D.

Non-Raid Non-Raid Non-Raid Non-Raid Raid1

Question 92

Which module is responsible for Fusioninsight Manager user data storage?

Options:

A.

CAS

B.

AOS

C.

Kerberos

D.

LDAP

Question 93

Kafka cluster during runtime,Directly depend on the following components?

Options:

A.

Zookeeper

B.

HDFS

C.

Spark

D.

HBase

Question 94

Which of the following options belong to Hive's data storage model?

Options:

A.

surface

B.

bucket

C.

database

D.

partition

E.

all of the above

Question 95

When the Loader of MRS creates a job, what is the role of the connector?

Options:

A.

Configure how jobs connect to external data sources

B.

Configure how jobs connect to internal data sources

C.

Provide optimization parameters to improve data import and export performance

D.

Make sure there are conversion steps

Question 96

When creating a Loader job, in which of the following steps can the number of Maps be set?

Options:

A.

output

B.

input settings

C.

convert

D.

Basic Information

Question 97

How is the HBaseM master elected?

Options:

A.

Randomly selected

B.

Adjudicated by RegionServer

C.

Adjudication via Zookeeper

D.

HMaster is a dual-master mode and does not need to be adjudicated

Page: 1 / 65
Total 649 questions