JuliaStats Projects – Summer of Code

JuliaStats is an organization dedicated to providing high-quality packages for statistics in Julia.

Panel data analysis

Implement panel analysis models and estimators in Julia.

Difficulty. Moderate. Duration. 350 hours

Description

Panel data is an important kind of statistical data that deals with observations of multiple units across time. Common examples of panel data include economic statistics (where it is common to observe figures for several countries over time). This combination of longitudinal and cross-sectional data can be powerful for extracting causal structure from data.

Mentors. Nils Gudat, José Bayoán Santiago Calderón, Carlos Parada

Prerequisites

Your contribution

Participants will:

References

Distributions.jl Expansion

Distributions.jl is a package providing basic probability distributions and associated functions.

Difficulty. Easy-Medium. Duration. 175-350 hours

Prerequisites

Your contribution

Possible improvements to Distributions.jl include:

HypothesisTesting.jl Expansion

HypothesisTesting.jl is a package that implements a range of hypothesis tests.

Difficulty. Medium. Duration. 350 hours

Mentors. Sourish Das, Mousum Dutta

Prerequisites

Your contribution

Improvements to Distributions.jl include:

References

CRRao.jl

Implement consistent APIs for statistical modeling in Julia.

Difficulty. Medium. Duration. 350 hours

Description

Currently, the Julia statistics ecosystem is quite fragmented. There is value in having a consistent API for a wide variety of statistical models. The CRRao.jl package offers this design.

Mentors. Sourish Das, Ayush Patnaik

Prerequisites

Your contribution

Participants will:

JuliaStats Improvements

General improvements to JuliaStats packages, depending on the interests of participants.

Difficulty. Easy-Hard. Duration. 175-350 hours.

Description

JuliaStats provides many of the most popular packages in Julia, including:

All of these packages are critically important to the Julia statistics community, and all could be improved.

Mentors. Mousum Dutta, Ayush Patnaik, Carlos Parada

Prerequisites

Your contribution

Participants will:

Survey.jl

This package is used to study complex survey data. Examples of real-world surveys include official government surveys in areas like economics, health and agriculture; financial and commercial surveys. Social and behavioural scientists like political scientists, sociologists, psychologists, biologists and macroeconomists also analyse surveys in academic and theoretical settings. The prevalence of "big" survey datasets has exploded with the ease of administering surveys online. The project aims to use performance enhancements of Julia to create a fast package for modern "large" surveys.

Difficulty. Easy-Hard. Duration. 175-350 hours

Mentors. Ayush Patnaik

Prerequisites

Your contribution

The project can be tailored around the background and interests of participants and depending on ability, several standalone mini-projects can be created. Participants can potentially work on:

References