GitHub - MattNolanLab/analysis_pipelines: A description of the major analysis pipeline's used in the Nolan Lab. This repo will describe the general principles behind the pipelines and link to implementations and experimental details

Overview

This is a README describing the major analysis pipelines used in the Nolan Lab at the University of Edinburgh.

The main pipelines are designed to do "primary" data analysis, defined as transforming raw data into derived data used for further scientific data analysis. For example, these pipelines turn raw electrophysiological data into spike trains, and videos into behavioural time series. The derived data is then in a simple format which should make analysis by individual researchers easy.

Here is an example of a pipeline:

Raw data (red), is transformed in derived data (teal) by scripts (green). We try to stick to some simple rules to keep things easy: one script should produce one piece of output; each script is accessible by going to github.com/{script_name}; it should be possible to run each script locally on your computer. Each experimenter gets to decide how their input and output files are named and organised, but we try to follow the NeuroBluePrint format. The file naming and organisation must be documented in each experiment's documentation.

Experiments

The pipelines have been used on experimental data obtained in the Lab. Each experiment is bespoke, and has its own peculiarities. As such, we keep separate documentation for each experiment although there is much overlap between them. The experiments with documentation are (note: you might need to be a member of the NolanLab GitHub organisation to view these):

These pages should contain all the information needed to understand the data obtained by each experiment, and run the pipeline. The pipeline consists of several scripts. The scripts are stored in GitHub repositories (repos).

Repositories for pipeline steps

We keep a repo at https://github.com/MattNolanLab/ for each of the major steps in the pipeline. The most important ones are:

Spike sorting and ephys quality control: https://github.com/MattNolanLab/nolanlab-ephys
Using DeepLabCut: https://github.com/MattNolanLab/nolanlab-dlc

These are simple "template" repos. They contain the "minimal viable code". Your experiment might require adjusting them. Instead of directly editing them, you should create your own fork of the repo. To do this, go to the original repo (e.g. https://github.com/MattNolanLab/nolanlab-ephys) then click the arrow next to "Fork" at the top, then "Create a new fork". This will open a new page. You can then adjust some settings before clicking "Create fork". There is now a copy of the repo on your personal GitHub page (e.g. Chris' one is at https://github.com/chrishalcrow/nolanlab-ephys/). Now clone this repo (git clone https://github.com/yourusername/nolanlab-ephys/). You can now do anything you'd like to this repo without affecting the original one.

This system is designed so that there is a simple working base to start from (e.g. the MattNolanLab/nolanlab-ephys has a functional spike sorting pipeline in it) but each researcher can easily customize the pipeline if needed. If this is all a bit overwhelming: don't worry. There is documentation, advice and instructions on each repo. Go have a look!

Help!! Git, GitHub, Python, uv, EDDIE...

All analysis pipelines are version controlled using Git and hosted on GitHub. They are mostly written in Python and package management is organised to work well with uv (though you can use venv or conda if desired) and work on the Edinburgh EDDIE supercomputer. Here are some general resources to help with these tools:

What is Git?
Making a project on GitHub
Zen of Python
Intro to Python
uv on EDDIE

And we maintain some helper packages:

Helpers for the EDDIE supercomputer: eddie-helper Loading data easily: loadi

We use many open source packages, primarily:

Many thanks to those who maintain these packages!

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
wolf_pipeline.png		wolf_pipeline.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Experiments

Repositories for pipeline steps

Help!! Git, GitHub, Python, uv, EDDIE...

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Overview

Experiments

Repositories for pipeline steps

Help!! Git, GitHub, Python, uv, EDDIE...

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages