A Gentle Introduction to Statistics Using SAS Studio in the Cloud
eBook - ePub

A Gentle Introduction to Statistics Using SAS Studio in the Cloud

  1. 306 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

A Gentle Introduction to Statistics Using SAS Studio in the Cloud

About this book

Point and click your way to performing statistics!

Many people are intimidated by learning statistics, but A Gentle Introduction to Statistics Using SAS Studio in the Cloud is here to help. Whether you need to perform statistical analysis for a project or, perhaps, for a course in education, psychology, sociology, economics, or any other field that requires basic statistical skills, this book teaches the fundamentals of statistics, from designing your experiment through calculating logistic regressions. Serving as an introduction to many common statistical tests and principles, it explains concepts in an intuitive way with little math and very few formulas.The book is full of examples demonstrating the use of SAS Studio's easy point-and-click interface accessed with SAS OnDemand for Academics, an online delivery platform for teaching and learning statistical analysis that provides free access to SAS software via the cloud.

Topics included in this book are:

  • How to access SAS OnDemand for Academics
  • Descriptive statistics
  • One-sample tests
  • T tests (for independent or paired samples)
  • One-way analysis of variance (ANOVA)
  • N-way ANOVA
  • Correlation analysis
  • Simple and multiple linear regression
  • Binary logistic regression
  • Categorical data, including two-way tables and chi-square
  • Power and sample size calculations

Questions are provided to test your knowledge and practice your skills.

Trusted by 375,005 students

Access to over 1.5 million titles for a fair monthly price.

Study more efficiently using our study tools.

Information

Publisher
SAS Institute
Year
2021
Print ISBN
9781954844452
eBook ISBN
9781954844476

14

Analyzing Categorical Data

Introduction

One of the data analysis tasks that you will be using quite often is to compute frequencies on categorical data and to analyze two-way frequency tables (tables with frequencies of one variable making up rows of the table and frequencies of another variable making up columns of the table). You might want to see survey results showing the number and percentage of subjects in one or more categories. Another common statistical test performed on categorical data is to see whether two categorical variables are related. For example, are people with high cholesterol more likely to suffer a heart attack compared to people with normal cholesterol levels? On average, do women earn less than men based on a sample of weekly salaries? This chapter covers two of the SAS Studio statistical tasks that deal with frequency data: the first is called One-Way Frequencies, the other, Table Analysis.

Describing the Salary Data Set

One of the data sets created when you ran the Create_Datasets.sas program is called Salary, and it is placed in the STATS library. This data set contains simulated data on weekly salaries broken down by gender, age group, and educational level. Besides the actual weekly salary (variable Weekly_Salary), there is another numeric variable (Salary) that has values of 0 (salary below the median) and 1 (salary above the median). Although this data set was simulated, the values were based on a data set from the U.S Department of Labor. The Department of Labor data contains salaries for multiple categories: age group, level of education, gender, and several other variables. For simplicity, the Salary data set contains only two levels for most of these variables (typically ones with extreme differences, such as ages 20–24 compared to ages 45–54, where large differences in salaries exist).
Before we get into the actual analysis of this data set, it is important to understand how these statistics are collected. One method, called uncontrolled, simply looks at median salaries for each category of predictor, such as gender or age group. The other, called controlled, looks at salaries for identical jobs and other factors such as years of employment. All of the data values for this simulated data set are based on the uncontrolled labor statistics values.
Variables in the Salary data set are displayed in the table that follows.
Variable
Description
Values
Weekly_Salary
Weekly salary
Actual amount in dollars
Salary
Above or below the median salary
0 = Below, 1 = Above
Gender
Gender
M = Male, F = Female
Age_Group
Age group
20–24, 45–54
Education
Educational level
Less than HS, College Degree or higher

Computing One-Way Frequencies

To compute one-way frequencies (frequencies for a single variable), go to the Statistics tab under Tasks to see a list of statistical tasks (Figure 14.1).
Figure 14.1: Demonstrating the One-Way Frequency Task
Double-click One-Way Frequencies in the Statistics task list to bring up the following screen (Figure 14.2).
Figure 14.2: DATA Tab Selections
On the DATA tab, choose the Salary data set stored in the permanent STATS library. Next, select the variables Salary, Gender, Age_Group, and Education in the Analysis variables box. Notice that the variable Weekly_Salary (which is a numeric variable) is included in your list of choices. If you include numeric variables with many different values (such as Weekly_Salary), the One_Way Frequency task will list frequencies for every unique value of the selected variable. If you included the variable Weekly_Salary as an analysis variable, it would tell you how many people earned $800, how many people earned $801 dollars, and so on. The only numeric variable chosen in this example is Salary. This is OK because this numeric variable is coded as 0 or 1, representing below the median and above the median.
Before you run the procedure, click the OPTIONS tab to select additional options.
Figure 14.3: One-Way Frequency Options
In this example, you have chosen to suppress plots and to deselect the default option to include cumulative frequencies (which you rarely need). You are now ready to run the procedure. The output is shown in Figure 14.4.
Figure 14.4: Frequency Tables
You see the frequency and percent for each unique value of these variables. Although this is useful information, it could be improved by replacing the values of Gender (F and M) with the labels Male and Female and replacing the values of Salary (0 and 1) with the labels Below the Median and Above the Median...

Table of contents

  1. Cover
  2. Title Page
  3. Copyright
  4. Table Of Contents
  5. About this Book
  6. About the Author
  7. Acknowledgments
  8. Descriptive and Inferential Statistics
  9. Study Designs
  10. What Is SAS OnDemand for Academics?
  11. SAS Studio Tasks
  12. Importing Data into SAS
  13. Descriptive Statistics – Univariate Analysis
  14. One-Sample Tests
  15. Two-Sample Tests
  16. Comparing More Than Two Means (ANOVA)
  17. N-Way ANOVA
  18. Correlation
  19. Simple and Multiple Regression
  20. Binary Logistic Regression
  21. Analyzing Categorical Data
  22. Computing Power and Sample Size
  23. Solutions to the Odd-Numbered Exercises
  24. Other Resources

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription
No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn how to download books offline
Perlego offers two plans: Essential and Complete
  • Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
  • Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.5M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Both plans are available with monthly, semester, or annual billing cycles.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1.5 million books across 990+ topics, we’ve got you covered! Learn about our mission
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more about Read Aloud
Yes! You can use the Perlego app on both iOS and Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app
Yes, you can access A Gentle Introduction to Statistics Using SAS Studio in the Cloud by Ron Cody in PDF and/or ePUB format, as well as other popular books in Computer Science & Programming. We have over 1.5 million books available in our catalogue for you to explore.