Apache Hive Data Types Hive is Data warehousing tool and used to process the data stored in hadoop and HDFS. Hive is similar to SQL because it analyze and process the data through querying language. In this article we are discuss about basic data types for Hive query processing. Hive Data Types are classified into four types, given as follows Column Types Literal Null Values Complex Types Column Types: 1. Integral Integer type data can be used to Integral data types. Integral data types mentioned as INT. There are four types INT data types TINYINT (1-byte signed integer, from -128 to 127) SMALLINT (2-byte signed integer, from -32,768 to 32,767) INT (4-byte signed integer, from -2,147,483,648 to 2,147,483,647) BIGINT (8-byte signed integer, from -9,223,372,036,854,775,808 to 9,223,372,036,854,775,807) 2. String Types: String Data Types mentioned by Single Quotes and Double Quotes. It contains two types – CHAR, VARCHAR CHAR – CHAR is the f...
Popular posts from this blog
CAP Theorem in Hadoop What is CAP Theorem? CAP theorem is designed for distributed file systems(collection of interconnected nodes).CAP Theorem also known as Brewer’s theorem and used to distributed consistency.It contains follwing three technical terms for distributed systems. C – Consistency A – Availability P – Partition Tolerance Consistency: When you read data it will give same data how many times read and server send response each and every request but systems always consistent when read data.(all node having same data) Availability: It means all requests give response and no error accured in this systems. Partition Tolerance: All functions run all time when more nodes not responsive and commnication break between two nodes Distributed systems statisfy any two terms only and not satisfy three terms Selecting Two options in CAP Theorem: CP – Consistency/Partition Tolerance: It wait for response form partioned nodes and that ...
Apache Hadoop Oozie Tutorial Introduction: Oozie is mainly used to manages the hadoop jobs in HDFS and it combines the multiple jobs in particular order to achieve the big task. It is the open source framework and used to make multiple hadoop jobs. Oozie supports the jobs in mapreduce,hive and hdfs also. In Oozie job workflow based on Directed Acylic Graph and it contains two nodes for managing the jobs that nodes are action and control flow nodes. Advantages of Oozie is it integrate with hadoop stack and also support mapreduce and hdfs jobs. Oozie contains following three types of jobs 1. Workflow jobs – It used to represents the sequence of jobs executed. 2. Coordinator Jobs – It contains workflow jobs and it triggered by time 3. Bundle Jobs – It contains the workflow and coordinator jobs Types of Nodes in Apache Oozie: Action Node – It represents the workflow jobs and jobs program are written in java Control F...
Comments
Post a Comment