Global Navigation

  • VPR
  • Funding & Proposals
    • Find Funding
    • Research Development & Proposal Services
    • Training Grants
    • Pre-Award Support
    • Post-Award Support
    • Collaboration
  • Compliance
    • Research Regulatory Support
    • Animal Care
    • Human Subjects
    • Environmental Health & Safety
    • Export Control
    • Conflict of Interest
    • Research Integrity
    • Stem Cell
    • University Research Organization
    • Outside Influence Guidance
    • MSU Policies
  • Commercialization
    • Innovation Center
    • Business Connect
    • MSU Technologies
    • Spartan Innovations
    • Bioeconomy Institute
    • AgBio Product Center
  • Resources
    • Getting Started in Research
    • Resource Contact List
    • Core Facilities
    • Centers & Institutes
    • Library
    • Training
    • Acronyms
    • Find an Expert
    • Events & Workshops
    • Event Archive
    • COVID-19 Research Response
  • Students

CSTAT Local navigation

  • About
  • Collaboration
  • Events
  • People
  • News
  • Resources
  • Case Studies
  • Contact
  • Giving

Search

Skip to Main Content

Global Navigation

  • VPR
  • Funding & Proposals
    • Find Funding
    • Research Development & Proposal Services
    • Training Grants
    • Pre-Award Support
    • Post-Award Support
    • Collaboration
  • Compliance
    • Research Regulatory Support
    • Animal Care
    • Human Subjects
    • Environmental Health & Safety
    • Export Control
    • Conflict of Interest
    • Research Integrity
    • Stem Cell
    • University Research Organization
    • Outside Influence Guidance
    • MSU Policies
  • Commercialization
    • Innovation Center
    • Business Connect
    • MSU Technologies
    • Spartan Innovations
    • Bioeconomy Institute
    • AgBio Product Center
  • Resources
    • Getting Started in Research
    • Resource Contact List
    • Core Facilities
    • Centers & Institutes
    • Library
    • Training
    • Acronyms
    • Find an Expert
    • Events & Workshops
    • Event Archive
    • COVID-19 Research Response
  • Students
Michigan State University Center for Statistical Training and Consulting

Search

Main navigation

  • About
  • Collaboration
  • Events
  • People
  • News
  • Resources
  • Case Studies
  • Contact
  • Giving
2023-06-06
Jun
6
2023

Initial data analysis in the example of Pokémon species

Noon – 1:00 pm
online

CSTAT Data visualization seminar series

Dr. Lara Lusa,  Natural  Sciences and Information Technologies, University of Primorska and Institute for Biostatistics and Medical Informatics, University of Ljubljana, Slovenia

Initial Data Analysis (IDA) consists of all steps performed on the data of a study between the end of the data collection and start of statistical analyses that address research questions. The value of an effective IDA strategy for data analysts lies in ensuring that data are of sufficient quality, that model assumptions made in the analysis strategy are satisfied and are adequately documented, and in supporting decisions for the statistical analyses.  Here we focus on the data screening step of IDA, where data properties are examined and effective visualizations are a fundamental tool. The objective of our work is to present recommendations on how to implement an IDA plan, how to create visualizations that are effective for the IDA, and make use of the IDA findings.

We present tutorial examples on how to conduct IDA data screening based on Pokémon data. More than 1000 different Pokémon species exist, which can be grouped in evolution chains. Several statistics and information describing each species are available, including their weight, height, (proportion of) gender, along with numerous statistics that describe the Pokémon’s ability in a battle.  

We define two research questions: (i) what are the predictors of Pokémon’s height? (ii) some Pokémon species have unknow gender, what are their predictors?  We present a brief statistical analysis plan (SAP) and develop and IDA data screening plan for the two research questions; we present the IDA report, which is implemented in the R language using a reproducible markup document that includes numerous visualizations. We discuss the interpretation of the results and the consequences of the IDA results, which indicate possible changes in the suggested SAP.  We end by briefly discussing the use of IDA in the context of high-dimensional data, where the number of variables is extremely large. 


Register Today

Share this on:

Share this page on Facebook

Share this page on Twitter

Share this page on Email

Quick Links

  • Office of Research and Innovation
  • Human Research Protection Program (HRPP)
  • Request Consultation
  • Institutional Animal Care & Use Committee (IACUC)
  • Responsible Conduct of Research
  • MSU Misconduct Hotline

Center for Statistical Training and Consulting

Giltner Hall
293 Farm Lane Room 100
East Lansing, MI 48824
517-353-9288

Michigan State University Info

Michigan State University
  • Privacy Statement
  • Site Accessibility
Call MSU: (517) 355-1855 | Visit: msu.edu | Notice of Nondiscrimination
SPARTANS WILL. | © Michigan State University