Buy Now and Pay in EMI's

STATISTICAL ANALYSIS USING R SOFTWARE

Abhiram Dash
  • Country of Origin:

  • Imprint:

    NIPA

  • eISBN:

    9789391383725

  • Binding:

    EBook

  • Number Of Pages:

    122

  • Language:

    English

Individual Price: 1,495.00 INR 1,345.50 INR + Tax

Add to cart Contact for Institutional Price
 

The book 'Statistical Analysis Using R software' deals with the fundamental concept of R software and the codes in R used for statistical analysis. It includes the development and content of R software, the concept of writing codes in R, importing of data from other software for use in R software. The codes in R used for statistical analysis purpose are mentioned in the mentioned and vividly explained. The book would definitely help the students and the personnel involved in teaching and research in understanding the concepts of R and its use for statistical analysis purpose.

0 Start Pages

Preface The book is designed for UG, PG and Ph.D students of all disciplines to gain fundamental knowledge on use of R software for statistical analysis. The book can also be helpful to the personnel engaged in teaching and research for gaining acquaintance with R software. The book provides fundamental concepts of R software and its application. The codes used in R mainly for statistical analysis purpose are vividly explained in the book. The syntax of various mathematical and statistical functions are mentioned alongwith the output obtained by the use of syntax. The concepts in the book have been analysed and presented in a simple and precise manner to the comprehension of the readers. I hope my endeavour would cater to the needs of the readers and render them immense benefit.

 
1 Introduction to R

R software is a programming language used for statistical analysis, graphical representation and reporting, and is currently developed by the R Development Core Team. R software is available freely under the GNU General Public License, and the binary versions are provided for the Windows, Linux and Mac operating systems. To develop R, effort from all over the world is made in a collaborative manner. What is GNU? GNU stands for GNU’s Not Unix. It is a Unix like computer operating system, but different from Unix. GNU consists of GNU packages which are program specific and released by GNU project. Free software means that the users of the software have freedom to run and use the software and also can copy, change, improve and distribute the software freely. Development of R R software was initially written by Robert Gentleman and Ross Ihaka also known as “R & R” of the statistics department of the University of Auckland, New Zealand. R appeared first in 1993. Since then various contribution to R has been made by a large group of talented individuals by sending code and bug reports. A core group, known as the “R Core Team” modify the R source code archive since 1997. Thus R is a collaborative project with many contributors.

1 - 8 (8 Pages)
INR167.00 INR151.00 + Tax
 
2 Objects, Functions and Operators in R

This chapter deals with the objects used in R which contains the data in various formats. The data types used in R has also been mentioned in this chapter. The chapter also includes the functions used in R as the codes for accomplishing several jobs related to data analysis. Unlike C language and Java, the variables in R are not declared as some data type. The variables are assigned with R object and the data type of R object becomes the data type of the variable. The commonly used R objects in R are vectors, lists, matrices, arrays, factors and data frames. The vector object is the simplest one and other R-objects are built upon the vectors. The different data types of these vectors are logical, numeric, integer, complex, character and raw. The objects in R are stored by the name of a variable and the data type of the object is determined by its content. To know the data type of a variable the class() function is used. The various data types are described as below:

9 - 36 (28 Pages)
INR167.00 INR151.00 + Tax
 
3 R Packages

R packages include functions, codes and sample data. The functions used in R are contained in packages. A package in R contains several functions which are related to some particular type of job. A package conatins at least one function and there is no maximum limit to the no. of functions in a package. The set of packages in R which are set by default in the software is called base package. It contains a lot of commonly used functions. The packages in R are stored under a directory called “library” in the R environment. The packages get installed by default while installing R in the computer. Also more packages can be added later on after installation, when required for some specific purposes. The base packages are available by default when we start using R. But the other packages which are already installed while installing R have to be loaded while using R. These packages can be viewed in the package window by clicking on package. To install the package we have to click on install menu in the package window and then write the name of the package. The packages can also be installed by using the function install.packages(“abc”) where abc is the name of the package to be installed. Once a package is installed, it remains in the hard disk of the PC/laptop. But the package will have to be called for use in the current R session. The package once installed can be loaded for use in the current R session by checking the box near the name of the package displayed after clicking on the install menu in the Files/ Plots/Packages/Help window. The package can also be loaded by using library() function. The command will be library(“abc”) where abc is the name of the package. The loaded package remains in the memory for the current R session. Once we quit the R session, the installed package has to loaded again if needed.

37 - 40 (4 Pages)
INR167.00 INR151.00 + Tax
 
4 Merging of Data in R

In R there is facility of reshaping the data. Reshaping of data includes joining of two or more vectors to create a data frame. Also two or more data frames can be merged. Appropriate functions are available for these jobs. While merging the data frames, we can join columns or rows. Illustration of joining two vectors by using cbind() function.

41 - 44 (4 Pages)
INR167.00 INR151.00 + Tax
 
5 Import and Export of Files in R

The most important use of R programming lies in importing of data files which serves as input for the analysis work. After the analysis is over by using R codes, operators and functions, the output result must also be exported in form of files. The files are imported in form of csv files (comma separated variable), text files or excel files. Before going to the actual content of the chapter we have to have an idea about csv files. CSV stands for Comma Separated value. It is used to store tabular data, such as a spreadsheet or database. CSV is delimited text file in which the values are separated by commas. CSV files occupy much less space than excel files and also gets imported in a program at a much faster rate than Excel files. CSV Files also does not manipulate data and stores as it is. Let us consider that an excel file named as ‘plant’ which stores three variables, named as ph, pd and yield is created inside the folder named as ‘My Data’ which is placed in the desktop. Thus the path of the file is “C:\\Desktop\\My Data\\plant” To import excel file the package will be readxl and the command will be library(readxl)

45 - 48 (4 Pages)
INR167.00 INR151.00 + Tax
 
6 Matrix operations in R

Matrix is a rectangular array of numbers arranged in rows and columns. The order of a matrix is written as r x c, where r is the no. of rows and c is the no. of columns. The numbers included in a matrix are called elements of the matrix. The name of the matrix is denoted by an uppercase English alphabet.

49 - 62 (14 Pages)
INR167.00 INR151.00 + Tax
 
7 Construction of Frequency Distribution by Using R

Frequency Distribution is the arrangement of various values of a variable in order of magnitude either individually or in groups alongwith their corresponding frequencies or class frequencies. Frequency distribution is necessary for data condensation and revealing pattern within the data. Frequency distribution can be (i) Ungrouped Frequency Distribution (ii) Grouped frequency distribution. In case ofungrouped frequency distribution, the individual values ofthe variable are arranged in ascending order alongwith their frequencies. Frequency is the number of times a particular observation is repeated. In case of grouped frequency distribution, the values of variable are grouped into classes alongwith their corresponding class frequencies. Class frequency of a class is the number of observations lying within the upper class limit (maximum value) and lower class limit (minimum value) of the class. Let us first illustrate the preparation of ungrouped frequency distribution. Consider 50 petridishes each containing 6 no.s of seeds for germination. Let the variable xi denote the no. of seeds germinated in a petridish.

63 - 66 (4 Pages)
INR167.00 INR151.00 + Tax
 
8 Diagrammatic and Graphical Presentation in R

Diagrams and graphs are the effective and attractive ways of presenting data. Widely used diagrams for data presentation are pi chart and bar diagrams. Bar diagrams are also of various types; (i) Simple bar diagram (ii) Component or sub-divided bar diagram (iii) Multiple bar diagram (iv) Percentage bar diagram. The widely used graphical presentation are histograms, frequency polygon and frequency curve. There are numerous libraries to create charts and graphs in R programming language. Pie-chart- A pie-chart is a representation of values of variable as sectors of a circle with different colours/shadings. The colours/shadings used for different sectors are indexed at the side of the graph. pie() function is used to create pie chrt in R. Let us create an exel file names as ‘exp_sports’ which contains the expenditure made (in Rs.) under various sports in a sports club. The name of the sports are cricket, hockey, tennis and football. The names of sports are placed under the column head ‘sports’. The corresponding expenditure (in Rs.) are 7200, 3600, 1800, 1800 and they are placed under the column head ‘expenditure’. The basic syntax for creating a pie-chart using the R is.

67 - 82 (16 Pages)
INR167.00 INR151.00 + Tax
 
9 Descriptive Statistics using R

The branch of statistics which describes the data is called descriptive statistics. After collection, the data are presented in form of tables, diagrams or graphs. This facilitates the further analysis of data. The most basic form of data analysis is descriptive analysis of data collected and presented for the study. The descriptive statistics summarises the data and helps to find the representative value of the entire data set and the variation of the other values in the data set around the representative value. The descriptive statistics includes the measures of central tendency or averages, measures of dispersion or scatteredness, measures of skewness and the measures of kurtosis. Before going to discuss the R codes for various measures of descriptive analysis, we would first discuss in brief about these measure. This will help in better understanding of the R codes. Measures of central tendency or averages: They are the representative values of the data series. They give an idea about the concentration of the values in the central part of the distribution of the data. There are five types of averages. They are 1. Arithmetic Mean (A.M.) 2. Geometric mean (G.M.) 3. Harmonic Mean (H.M.) 4. Median 5. Mode.

83 - 108 (26 Pages)
INR167.00 INR151.00 + Tax
 
10 End Pages

Annexures Annexure 1 Expenditure under different sports in a sports club

 
9cjbsk

Browse Subject

Payment Methods