Long Intergenic Non-coding RNAs (lincRNAs) are the largest class of long non-coding RNAs in eukaryotes, originating from the genome's intergenic regions. A ∼4 kb long lincRNA-p21 is derived from a transcription unit next to the p21/Cdkn1a gene locus. LincRNA-p21 plays regulatory roles in p53-dependent transcriptional and translational repression through its physical association with proteins such as hnRNPK and HuR. It is also involved in the aberrant gene expression in different cancers. In this study, we have carried out a bioinformatics-based gene analysis and annotation of lincRNA-p21 to show that it is highly conserved in primates and identified two conserved domains in its sequence at the 5' and 3' terminal regions. hnRNPK has previously been shown to interact specifically with the 5' conserved region of lincRNA-p21. hnRNPK is known to bind preferentially to the pyrimidine-rich (poly C) nucleotide sequences in RNAs. Interestingly, we observed a single occurrence of a cytosine-rich patch (C-patch) consisting of a CUCCCGC sequence in the 5' conserved region of human lincRNA-p21, making it a putative hnRNPK binding motif. Using NMR and ITC experiments, we showed that the single-stranded C-patch containing RNA sequence motif interacts specifically with the KH3 domain of hnRNPK.
Keywords: Bioinformatics; ITC; NMR CSP; Sequence conservation; hnRNPK; lincRNA-p21.
© 2023 The Authors.