Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Making the parent-child relationship fieldname configurable #78

Open
tdido opened this issue Jun 28, 2021 · 6 comments · May be fixed by #83
Open

Making the parent-child relationship fieldname configurable #78

tdido opened this issue Jun 28, 2021 · 6 comments · May be fixed by #83

Comments

@tdido
Copy link

tdido commented Jun 28, 2021

Hello.

Some GFF3 files use the "Derives_from" field instead of the "Parent" field to establish a parent-child relationship (see an example from miRBase: ftp://mirbase.org/pub/mirbase/CURRENT/genomes/ath.gff3).

Would you consider a pull request to add an argument that makes this configurable?

@agshumate
Copy link
Owner

Hi,
Apologies for the late response. I am certainly happy to consider any pull request that allows liftoff to handle a wider ranger of GFF3s.
-Alaina

tdido added a commit to tdido/Liftoff that referenced this issue Jul 23, 2021
This is usefult for example in the case of miRBase
GFF files, where "Derives_from" is used instead of
"Parent".

closes agshumate#78
@tdido tdido linked a pull request Jul 23, 2021 that will close this issue
@Juke34
Copy link

Juke34 commented Sep 2, 2021

This can be dangerous, Derives_from do not have the same meaning as "Parent". We should inform mirbase.org to be more compliant with the official specification: https://github.com/The-Sequence-Ontology/Specifications/blob/master/gff3.md
Personnaly I'm not sure this is something that has to be done within LiftOff. Users should provide proper input files (And fix them if needed).

In this pipeline https://github.com/guigolab/FA-nf they added AGAT as GFF/GTF sanitizer, LiftOff is not a pipeline, but we could provide a guideline how to sanitize GFF/GTF files before to use LiftOff.

@tdido
Copy link
Author

tdido commented Sep 2, 2021

Hi, thanks for your reply.

This is of course debatable, but I don't necessarily see it as dangerous, since the default behaviour is still using Parent. If the user is using arguments one could assume she/he knows what she/he is doing.

In particular, I guess miRBase considers the mature miRNA transcripts to be somewhat akin to Polycistronic transcripts as described in the document you linked, since there are different miRNA hairpins that generate the exact same mature sequence.

In any case, thank you for your feedback and feel free to close the request if you think it's not worth including.

@agshumate
Copy link
Owner

Hi,
I do agree that it is not dangerous since the default is still Parent so changing it from that is sort of 'at your own risk'. I also really like AGAT as a way to sanitize GFF/GTFs. I will add that to the readme and merge in the pull request unless there are further concerns about that.

@Juke34
Copy link

Juke34 commented Sep 3, 2021

Right, dangerous was a bit strong ^^. I agree a novice will not play with that parameter.
I mainly wanted to warn that trying to be compliant with all sort of GFF/GTF types that can be met might be a never ending work.

@jolbi
Copy link

jolbi commented Aug 28, 2023

Hi,

Has this feature been implemented? I have miRNA gff like that. How can I tell liftoff to use "Derives_from" field?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants