Investigation of the pattern of hepatitis C virus sequence diversity in different geographical regions: implications for virus classification. The International HCV Collaborative Study Group

J Gen Virol. 1995 Oct;76 ( Pt 10):2493-507. doi: 10.1099/0022-1317-76-10-2493.


Genotypes of hepatitis C virus (HCV) present within 104 samples from HCV-infected individuals from Africa, the Middle East, the Indian subcontinent and South-East Asia were identified by sequence comparisons in the core and NS-5 regions. Relatively short sequences (such as the 222 bp fragment of NS-5) provided effective discrimination of types, subtypes and isolates, and produced equivalent relationships between genotypes as were found upon comparison of longer sequences of NS-5, of the core region, and by comparison of the limited number of complete genomic sequences currently available. Measurement of evolutionary distances in the core and NS-5 regions allowed 79 of the 104 samples to be identified as examples of known genotypes, while 17 of the remainder could be provisionally classified as new subtypes of types 1 (Nigeria), 2 (Gambia), 3 (India, Pakistan and Bangladesh) and 4 (Middle East) on the basis of sequence comparison in core and NS-5 (n = 9) or provisionally using core alone (n = 8). The remaining sequences from Thailand made up two groups showing no close similarity to any of the six major genotypes classified to date, although one corresponded to an as yet unclassified variant of HCV also found in Thailand. However, phylogenetic analysis of the core and NS-5 regions indicated a distant relationship between these sequences with variants found in Vietnam and with type 6a, and collectively they formed a diverse single phylogenetic group. The existence of great diversity within a single genotype was also found amongst type 3 sequences in the Indian subcontinent, amongst type 4 variants in Central Africa and the Middle East, and amongst type variants in Nigeria. These findings may provide clues for understanding the origins and mechanisms of transmission of HCV.

MeSH terms

  • Africa
  • Asia
  • Base Sequence / genetics
  • Europe
  • Genes, Viral / genetics
  • Genetic Variation / genetics*
  • Genome, Viral*
  • Genotype
  • Hepacivirus / classification*
  • Hepacivirus / genetics*
  • Humans
  • Molecular Sequence Data
  • Phylogeny
  • RNA, Viral / blood
  • RNA, Viral / genetics*
  • Sequence Analysis, DNA
  • Terminology as Topic
  • Viral Core Proteins / genetics
  • Viral Nonstructural Proteins / genetics


  • RNA, Viral
  • Viral Core Proteins
  • Viral Nonstructural Proteins
  • nucleocapsid protein, Hepatitis C virus
  • NS-5 protein, hepatitis C virus

Associated data

  • GENBANK/D00944
  • GENBANK/D10078
  • GENBANK/D10079
  • GENBANK/D10080
  • GENBANK/D10081
  • GENBANK/D10134
  • GENBANK/D10641
  • GENBANK/D10642
  • GENBANK/D10643
  • GENBANK/D10644
  • GENBANK/D10645
  • GENBANK/D10646
  • GENBANK/D10647
  • GENBANK/D10648
  • GENBANK/D10649
  • GENBANK/D10650
  • GENBANK/D10749
  • GENBANK/D10750
  • GENBANK/D10934
  • GENBANK/D10988
  • GENBANK/D13558
  • GENBANK/D14853
  • GENBANK/D90208
  • GENBANK/L02836
  • GENBANK/M62321
  • GENBANK/M62384
  • GENBANK/M62583
  • GENBANK/M67463
  • GENBANK/M84754
  • GENBANK/X61956