Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update to v2.0.0 #70

Merged
merged 1 commit into from
Oct 13, 2023
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
23 changes: 21 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
# Single Cell Expression Atlas database loading module (v1.0.0)
# Single Cell Expression Atlas database loading module (v2.0.0)

A [Single Cell Expression Atlas](https://www.ebi.ac.uk/gxa/sc) module for loading experiments to a Postgres 11
database. Release v0.4.0 was used for [the October 2022 data release of Single Cell Expression
Expand Down Expand Up @@ -63,7 +63,7 @@ export dbConnection=...
delete_db_scxa_dimred.sh
```

## `scxa_cell_group` and `scxa_cell_group_membership` table
## `scxa_cell_group` and `scxa_cell_group_membership` tables

### Load data
Run `bin/load_db_scxa_cell_clusters.sh`. It requires the following environment variables:
Expand Down Expand Up @@ -117,6 +117,25 @@ export dbConnection=...
delete_db_scxa_marker_genes.sh
```

## `exp_design` and `exp_design_column` tables

### Load data
Run `bin/load_exp_design.sh`. It requires the following environment variables:

| Variable name | Description |
|-----------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| `CONDENSED_SDRF_FILE` | Path of the condensed SDRF file of the experiment being loaded |
| `SDRF_FILE` | Path of the SDRF file of the experiment being loaded |
ke4 marked this conversation as resolved.
Show resolved Hide resolved
| `dbConnection` | A Postgres connection string in the form `postgresql://{user}:{password}@{host:port}/{databaseName}` pointing to a Postgres 11 server where the expected `scxa_cell_group_marker_genes` and `scxa_cell_group_marker_gene_stats` tables exist |

### Delete data
Currently, there’s no script to delete data from these tables. You can do it manually with the following SQL statements:

```sql
DELETE FROM exp_design WHERE exp_design_column_id IN (SELECT id FROM exp_design_column WHERE experiment_accession='E-FOO-123');
DELETE FROM exp_design_column WHERE experiment_accession='E-FOO-123';
```

## Post-loading a batch of experiments
Once a number of experiments have been loaded, tables should be re-indexed:

Expand Down
Loading