Data supporting a saturation mutagenesis assay for Tat-driven transcription with the GigaAssay

Data Brief. 2022 Sep 28:45:108641. doi: 10.1016/j.dib.2022.108641. eCollection 2022 Dec.

Abstract

The data in this article are associated with the research paper "GigaAssay - an adaptable high-throughput saturation mutagenesis assay" [1]. The raw data are sequence reads of HIV-1 Tat cDNA amplified from cellular genomic DNA in a new single-pot saturation mutagenesis assay designated the "GigaAssay". A bioinformatic pipeline and parameters used to analyze the data. Raw, processed, analyzed, and filtered data are reported. The data is processed to calculate the Tat-driven transcription activity for cells with each possible single amino acid substitution in Tat. This data can be reused to interpret Tat intermolecular interactions and HIV latency. This is one of the largest and most complete datasets regarding the impact of amino acid substitutions within a single protein on a molecular function.

Keywords: High-throughput assay; Intragenic epistasis; Loss of Function (LOF); Protein structure; Saturation mutagenesis; Tat; Transcription.