Lab manual

Research

Project life cycle

Initializing a project

  1. Register your project in the lab's project register:
    1. Go to https://cemmat.sharepoint.com/sites/rendeirolab and find the 'Lab project register.xlsx' or directly here.
    2. Start a new row for your project and increment the Project ID by one.
    3. Choose an intuitive name for the project, avoiding adding personal information e.g. collaborator names.
  2. Create a project directory structure from the lab's template: cookiecutter gh:rendeirolab/_project_template
  3. Create a git repository on GitHub (https://github.com/rendeirolab) with the same name as the project, make a first commit and push to Github.
  4. Create a directory for the project in the CeMM cluster at: /research/groups/lab_rendeiro/projects/
    1. Create a directory inside called data to store raw data.
  5. Create a directory for the project in the CeMM cluster at: /nobackup/groups/lab_rendeiro/projects/
    1. Create a soft link between /research/.../data and /nobackup/.../data
  6. Create a cemm_metadata.json file in /research/groups/lab_rendeiro/projects/$PROJECT/
  7. Create a cemm_metadata.json file in /nobackup/groups/lab_rendeiro/projects/$PROJECT/

You can find various metadata JSON templates at /research/groups/lab_rendeiro/projects/_templates/.

Make sure to validate your JSON files, e.g.:

wget --no-check-certificate https://metadata.int.cemm.at/latest/cemm_metadata_schema.json
uvx check-jsonschema --schemafile cemm_metadata_schema.json cemm_metadata.json 

Developing a project

See the Project Planning section for conceptual details on how to design and develop a project.

From a practical standpoint, also refeer to the lab's Project Template for instructions on how to set up a project repository, and how to structure your project files.

Project management

  • Make sure to maintain the cemm_metadata.json files in line with the existing data on disk.
  • Version control: Use git for all code and analysis scripts. Commit frequently with descriptive messages.
  • Naming conventions: Adhere to consistent and descriptive naming conventions for files and directories to ensure clarity and ease of access.
  • File organization: Maintain a clear and logical directory structure for each project, separating raw data, processed data, scripts, results, and documentation.
  • Data backup: Backup only original data, or timestamped versions of processed data. Use the CeMM cluster's /research/groups/lab_rendeiro/projects/$PROJECT directory for this purpose.

Reporting research

Figures are the core of communicating reserach:

  • Figures are crucial for communicating research. We encourage lab members to:
  • Conceptualize figures early: Sketch potential figures during project development to visualize your story and refine experimental design. This "Figure Zero" mentality focuses work on key data that supports your narrative.
  • Assemble figures often: Don't wait until the end. Assemble plots into draft figures iteratively as data comes in. This helps identify gaps, maintain quality, and keeps publication as the ultimate goal.
  • Leverage vector graphics: Figures created as vector graphics (SVG, PDF) are highly versatile. They scale without resolution loss and can be reused efficiently across publications, presentations, grants, and posters, saving significant time.

PowerPoint, often due to misuse, can be detrimental to effective communication by:

  • Enforcing rigid structure: Its linear format can oversimplify complex ideas, losing nuance.
  • Encouraging style over substance: Time is often spent on aesthetics rather than refining the core message.
  • "Death by PowerPoint": Poorly constructed slides can bore and disengage the audience.

By focusing on making figures proactively and using them to communicate (instead of PowerPoint), you'll develop your skills of visual story telling, enable you to tailor your analysis more directly to the a final figure plant and accelerate the path to publication.

Tools and technologies of standard use within the lab

File types of digital data

Below are the preferred types of technology to be used in the lab to ensure consistency and efficiency:

  • Documentation: Markdown (for manuals, READMEs, lab notebooks)
  • Programming language: Python (primary language for data analysis and scripting)
  • Image files: OME-TIFF (for high-resolution images with extensive metadata support)
  • Tabular form files: CSV (or csv.gz), Parquet (for efficient storage and retrieval of tabular data)
  • Plots: SVG, PDF (for vector graphics that scale without pixelation)

Engaging in new projects

Be receptive to input and suggestions from others on collaborating and engaging in new projects.

However, a careful balance between finishing existing projects and starting new ones is crucial.

Consult with Andre before committing to new projects, especially if they involve significant time investment or resources.

Meta-science


Edit this page on GitHub