Statistics homework help

This week’s readings discuss conditional probabilities, conditional odds, logits, odds ratios, relative risk, and slopes. These can all be confusing terms but the good news is that all these values have some relationship to each other. Researchers have their own opinions on which values makes the most sense to report.

Discussion 

In a 2- to 3-paragraph post, construct a persuasive argument for the value (conditional probability, odds, odds ratio, etc.) that, intuitively, makes the most sense for you to report as a result to your audience. Be sure to provide a specific rationale for your choice.

Statistics homework help

Research Question:
As the Director of Academic Assessment at City University (CU) you are asked to determine if the SAT test is a significant predictor of persistence of first year of college students.  The sample consisted of the 100 freshmen students that enrolled in CU as first-time full-time students in the Fall 2019 semester.
Because the SAT scores for incoming class of students as significantly and severely skewed – thus violating the assumption of normality – you dichotomized the students into two SAT groups coded zero and one.  Students that scores below the median score of 1000 on the SAT were given a code of 0, and classified as the Low SAT students.   Students that scored 1000, or above,  on the SAT were given a code of 1, and classified as the High SAT students.
The outcome measure (DV) was year to year persistence. Specifically, whether or not the student re-enrolled as a Full-time student in the Fall 0220 semester.  Those students that dropped out of college and didn’t enroll in class during the Fall 2020 semester were assigned a code of 0 on the persistence measure.  Students that enrolled full-time in the Fall of 2020, the “persisters”, were assigned a code of 1 on the persistence measure.
Analyze the data to determine if SAT classification (High or Low) is significantly related to persistence
Using the attached SPSS dataset construct a 2×2 contingency table using crosstabs, and then conduct a simple binary logistical regression analysis in which the variable Persist of the Dependent variable, and the variable SAT is the Covariate (predictor) variable
Be sure to present and discuss the significance of the overall analysis (model).  Be sure to present and discuss:

  • Conditional probabilities
  • Conditional odds
  • Logits
  • Odds ratios
  • Relative risk
  • Slope
  • Intercept
  • the results for the overall model (Chi-square test and R-square measures)
  • The results for Wald’s test.

Also be sure to include the null and alternative hypotheses for the analoysis.
You summary should be 2 to 3 pages.  Please use my sample summary as a model-for your summary.

  • attachment

    Week4SATMandPersistence.sav
  • attachment

    APASumaryforCancerTreatmentStudyforORs_1_1.docx

Statistics homework help

Provide (2) 200 words response with a minimum of 1 APA references for RESPONSES 1 AND 2 below. Responses may include direct questions. In your peer posts, compare the probabilities that you found with those of your classmates. Were they higher/lower and why? In your responses, refer to the specific data from your classmates’ posts. Make sure you include your data set in your initial post as well. Attached are the excel docs for both responses.
RESPONSE 1:
For this weeks forum I determined that my average price for the SUV’s that I selected was $49,903.60. Out of the 10 vehicles that I chose 6 of the cars were less than the average of all the cars price giving me a probability of p=.60 and a q=.40. When I calculated the probability of 10 randomly selected cars that exactly 4 of them will fall below my average I got 11%. When I calculated the probability that fewer than 5 of them would fall below the average I got slightly over 16%. Next, I calculated the probability that more than 6 of them would fall below my average and got 38%. What was interesting is that when I calculated the probability that at least 4 cars would fall below the average of my vehicles I got 94.5%. I think that this is a good exercise to learn in being able to speak to the likelihood that a car dealer will likely have the vehicle a customer is looking for based on the customer’s price point. If the dealer knows their averages they can also make suggestions and marketing assumptions that they have a good probability that a customer will find a vehicle of their choice within their price range using the information we covered in this weeks lesson. The formulas based on the pdf were explained at the bottom of the attached spreadsheet.
RESPONSE 2:
The average of my vehicles came out to $16,593. Half of the car’s prices fell below the average which made my probability success and failure 0.5. Both p=0.5 and q=0.5.
Average = $16,593
5 of my vehicles fall below the average
P= 5/10
P or Success = 0.5
Q = 1-p
Q = 1-0.5
Q or Failure = 0.5
In another random sample using the same data, the probability that exactly 4 cars would fall below the average is 21%.
In another random sample using the same data, the probability that fewer than 5 vehicles would fall below the average is 62%.
In another random sample using the same data, the probability that more than 6 vehicles will fall below the average is 17%.
In another random sample using the same data, the probability that at least 4 vehicles will fall below the average is 83%.
The results did not particularly surprise me. Since my probability was 0.5 is made it easier to simply look at the numbers and make a guess. The result that probably surprised me the most was how little the probability for exactly 4 vehicles falling below the average was. But then again, anytime you see the word “exactly” I would imagine a lower probability. Another extremely helpful pdf for this exercise. It was easy to follow and helped me understand the material more clearly.
Jenny
  • attachment

    Response1.xlsx
  • attachment

    Response2.xlsx

Statistics homework help

Please follow Rubric in addition to answer the question on the word doc.
  • attachment

    Week4ProjectRubric.pdf
  • attachment

    STAT3001Week4Project_2_.docx

Numerical analysis homework help

Deliverable 6 – Analysis with Correlation and Regression

Competency

Determine and interpret the linear correlation coefficient, and use linear regression to find a best fit line for a scatter plot of the data and make predictions.

Scenario

According to the U.S. Geological Survey (USGS), the probability of a magnitude 6.7 or greater earthquake in the Greater Bay Area is 63%, about 2 out of 3, in the next 30 years. In April 2008, scientists and engineers released a new earthquake forecast for the State of California called the Uniform California Earthquake Rupture Forecast (UCERF).
As a junior analyst at the USGS, you are tasked to determine whether there is sufficient evidence to support the claim of a linear correlation between the magnitudes and depths from the earthquakes. Your deliverables will be a PowerPoint presentation you will create summarizing your findings and an excel document to show your work.
Concepts Being Studied

  • Correlation and regression
  • Creating scatterplots
  • Constructing and interpreting a Hypothesis Test for Correlation using r as the test statistic

You are given a spreadsheet Click for more options
that contains the following information:

  • Magnitude measured on the Richter scale
  • Depth in km

Using the spreadsheet, you will answer the problems below in a PowerPoint presentation.

What to Submit

The PowerPoint presentation should answer and explain the following questions based on the spreadsheet provided above.

  • Slide 1: Title slide
  • Slide 2: Introduce your scenario and data set including the variables provided.
  • Slide 3: Construct a scatterplot of the two variables provided in the spreadsheet. Include a description of what you see in the scatterplot.
  • Slide 4: Find the value of the linear correlation coefficient r and the critical value of r using α = 0.05. Include an explanation on how you found those values.
  • Slide 5: Determine whether there is sufficient evidence to support the claim of a linear correlation between the magnitudes and the depths from the earthquakes. Explain.
  • Slide 6: Find the regression equation. Let the predictor (x) variable be the magnitude. Identify the slope and the y-intercept within your regression equation.
  • Slide 7: Is the equation a good model? Explain. What would be the best predicted depth of an earthquake with a magnitude of 2.0? Include the correct units.
  • Slide 8: Conclude by recapping your ideas by summarizing the information presented in context of the scenario.

Along with your PowerPoint presentation, you should include your Excel document which shows all calculations.

  • attachment

    Deliverable_06_Data_ma.xlsx

Numerical analysis homework help

Deliverable 6 – Analysis with Correlation and Regression

Competency

Determine and interpret the linear correlation coefficient, and use linear regression to find a best fit line for a scatter plot of the data and make predictions.

Scenario

According to the U.S. Geological Survey (USGS), the probability of a magnitude 6.7 or greater earthquake in the Greater Bay Area is 63%, about 2 out of 3, in the next 30 years. In April 2008, scientists and engineers released a new earthquake forecast for the State of California called the Uniform California Earthquake Rupture Forecast (UCERF).
As a junior analyst at the USGS, you are tasked to determine whether there is sufficient evidence to support the claim of a linear correlation between the magnitudes and depths from the earthquakes. Your deliverables will be a PowerPoint presentation you will create summarizing your findings and an excel document to show your work.
Concepts Being Studied

  • Correlation and regression
  • Creating scatterplots
  • Constructing and interpreting a Hypothesis Test for Correlation using r as the test statistic

You are given a spreadsheet Click for more options
that contains the following information:

  • Magnitude measured on the Richter scale
  • Depth in km

Using the spreadsheet, you will answer the problems below in a PowerPoint presentation.

What to Submit

The PowerPoint presentation should answer and explain the following questions based on the spreadsheet provided above.

  • Slide 1: Title slide
  • Slide 2: Introduce your scenario and data set including the variables provided.
  • Slide 3: Construct a scatterplot of the two variables provided in the spreadsheet. Include a description of what you see in the scatterplot.
  • Slide 4: Find the value of the linear correlation coefficient r and the critical value of r using α = 0.05. Include an explanation on how you found those values.
  • Slide 5: Determine whether there is sufficient evidence to support the claim of a linear correlation between the magnitudes and the depths from the earthquakes. Explain.
  • Slide 6: Find the regression equation. Let the predictor (x) variable be the magnitude. Identify the slope and the y-intercept within your regression equation.
  • Slide 7: Is the equation a good model? Explain. What would be the best predicted depth of an earthquake with a magnitude of 2.0? Include the correct units.
  • Slide 8: Conclude by recapping your ideas by summarizing the information presented in context of the scenario.

Along with your PowerPoint presentation, you should include your Excel document which shows all calculations.

  • attachment

    Deliverable_06_Data_ma.xlsx

Numerical analysis homework help

Deliverable 3 – Confidence Intervals

Competency

Given a real-life application, develop a confidence interval for a population parameter and its interpretation.

Instructions

Scenario (information repeated for deliverable 01, 03, and 04)
A major client of your company is interested in the salary distributions of jobs in the state of Minnesota that range from $30,000 to $200,000 per year. As a Business Analyst, your boss asks you to research and analyze the salary distributions. You are given a spreadsheet Click for more options
that contains the following information:

  • A listing of the jobs by title
  • The salary (in dollars) for each job

You have previously explained some of the basic statistics to your client already, and he really liked your work. Now he wants you to analyze the confidence intervals.
Background information on the Data
The data set in the spreadsheet consists of 364 records that you will be analyzing from the Bureau of Labor Statistics. The data set contains a listing of several jobs titles with yearly salaries ranging from approximately $30,000 to $200,000 for the state of Minnesota.

What to Submit

Your boss wants you to submit the spreadsheet with the completed calculations. Your research and analysis should be present within the answers provided on the worksheet. Click for more options

  • attachment

    Deliverable_03_Questions.docx
  • attachment

    Deliverable_Spreadsheet_Del1_Del3_Del41.xlsx

Statistics homework help

Please follow Rubric in addition to answer the question on the word doc.
  • attachment

    Week4ProjectRubric.pdf
  • attachment

    STAT3001Week4Project_2_.docx

Statistics homework help

Instructions

You are currently working at NCLEX Memorial Hospital in the Infectious Diseases Unit. Over the past few days, you have noticed an increase in patients admitted with a particular infectious disease. You believe that the ages of these patients play a critical role in the method used to treat the patients. You decide to speak to your manager, and together you work to use statistical analysis to look more closely at the ages of these patients.
You do some research and put together a spreadsheet Click for more options
of the data that contains the following information:

  • Client number
  • Infection disease status
  • Age of the patient

You are to put together a PowerPoint presentation that explains the analysis of your findings which you will submit to your manager. The presentation should contain all components of your findings. For review, the components of the report should include:

  1. Brief overview of the scenario and variables in the data set
  2. Discussion, calculation, and interpretation of the mean, median, mode, range, standard deviation, and variance
  3. Discussion, construction, and interpretation of the 95% confidence interval
  4. Explanation of the full hypothesis test
  5. Conclusion

The calculations should be performed in your spreadsheet that you will also submit to your manager. You can find additional information on what to add to your PowerPoint presentation in this Word document Click for more options
. Use the questions in the worksheet as your guide for the contents of your presentation.
For your final deliverable, submit your PowerPoint presentation and the Excel workbook showing your work. Do not submit your Word document.

  • attachment

    Deliverable_07_Data.xlsx
  • attachment

    Deliverable_07_Questions_ma.docx