Minimum Data Requirements
IMEx partner databases request the following minimum information about a molecular interaction experiment to be provided with each data deposition. Data should be MIMIx compatible. Submitters should supply the following information. 1. Manuscript information
- Contact email
- Publication title
- First author
- Publication identifier
2. Experiment Experimental procedures should be described in detail, preferably using the controlled vocabulary terms produced by the Human Proteome Organization Proteomics Standards Initiative (HUPO-PSI). These are available on the PSI web site or the Ontology Lookup Service. To include:
- Host system (NCBI taxid and details of tissue, cell line…)
- Interaction detection method
- Participant detection method
- Participant – to be identified by either molecule identifiers from a public domain database or a sequence accompanied by species of origin (NCBI taxid).
- Protein accession number from UniProtKB, RefSeq, or a model organism database.
- Gene identifiers from Ensembl, Entrez Gene, a model organism database or nucleotide accession number from DDBJ, EMBL, GenBank.
- Small molecules should be described with full chemical names of accession number from ChEBI or PubChem.
- Biological role of each participant (e.g. enzyme)
- Experimental role of each participant (e.g. bait)
4. Confidence If applicable, a confidence value attributed to the interaction.
Data Submission by e-mail
Several databases will accept submissions in free text format, for example a manuscript prepared for submission. Either send an e-mail to Contact us alerting us to your data and we will decide on the most apporpirate database and help with data preparation of contact one of the below directly.
- IntAct (www.ebi.ac.uk/intact), please send to firstname.lastname@example.org
- DIP (http://dip.doe-mbi.ucla.edu), please send to email@example.com
- MINT (http://mint.bio.uniroma2.it/mint), please send to firstname.lastname@example.org
Data Submission by PSI-MI Interchange Format
Producers of large datasets may wish to submit their data in the form of XML files, following the PSI-MI standard for molecular interactions. The current version is 2.5. Files should then be sent to any of the participating databases listed above. It may be useful to view the following example of a validated file containing minimal information but we would encourage the addition of any further experimental detail which is available.
Data Submission by Excel Sheet
For biologists who wish to submit data in a structured form but do not wish to produce an XML file, a Microsoft Excel© submission sheet has been prepared. Please note:
- This spreadsheet will not operate correctly on the Apple Mac version of Microsoft Excel©
- This spreadsheet makes use of Visual Basic macros to allow it to connect to the Ontology Lookup Service to retrieve the latest PSI-MI ontology terms and to allow it to select the correct ontology term accessions, this making it easy for you to select the correct terms to annotate your data.
For your security, we have signed the VBA in this spreadsheet with a digital certificate issued by Thawte™. When you first open the spreadsheet you will be presented with one of the following two dialogue boxes, depending on the current security setting in your version of Excel.
Macros will then be automatically enabled for this spreadsheet when you open it in the future. Once you have accepted the certificate, the spreadsheet will proceed to the update of its controlled vocabularies. You will be prompted the following message box beforehand:
If the 'Update PSI-MI CV Terms' dialogue box does not appear, you will not be able to enter data on to the spreadsheet as the necessary macros are disabled. If this occurs, or if you encounter any other problems, please report to . Files should then be sent to any of the participating databases listed above. It may be useful to view the following example of a validated file containing minimal information but we would encourage the addition of any further experimental detail which is available.
Data Submission by Web Form
For users who prefer to use a structured web form to add data, such a tool has been built which allows the user to move successively through the submission process, listing the experiments, interactors and interactions as described above. Files should then be sent to any of the participating databases listed above.
Data Submission - a Worked Example
To aid in data deposition the following publication is represented in the multiple forms offered for IMEx data deposition, to provide a reference example: Croze, E., Usacheva, A., Asarnow, D, Minshall, R.D., Perez, H.D., Colamonici O PubMed ID: 11046044 Receptor for activated C-kinase (RACK-1), a WD motif-containing protein, specifically associates with the human type I IFN receptor.
J Immunol. 2000 Nov 1;165(9):5127-32. The data is available in:
- Microsoft Excel© submission sheet
- PSI-MI XML version 2.5
- HTML view. A similar view can be automatically generated and provided to the data submitter.
Full annotation of the data in this paper can be viewed in the IntAct database at www.ebi.ac.uk/intact using Accession Nos.
All IMEx partners agree to release submitted data only when accepted by a peer reviewed journal for publication and not before the date of publication of the corresponding paper. The data will be made public by the database to which it was submitted and released as an IMEx record which may be included in the datasets of any or all IMEx databases.