ABSTRACT
X-linked agammaglobulinemia (XLA) is an immunodeficiency caused by mutations
in the gene coding for Bruton's agammaglobulinemia tyrosine kinase (BTK). A
database (BTKbase) of BTK mutations has been compiled and the recent update
lists 368 entries from 318 unrelated families showing 228 unique molecular
events. In addition to mutations the database lists also some polymorphisms and
site-directed mutations. Each patient is given a unique patient identity number
(PIN). Information is provided regarding the phenotype including symptoms.
Mutations in all the five domains of BTK have been noticed to cause the
disease, the most common event being missense mutations. The mutations appear
almost uniformly throughout the molecule and frequently affect CpG sites
forming arginine residues. These hot spots have generally pyrimidines 5
'
and purines 3
' to the mutated cytosine. A decreased frequency of missense mutations was found
in the TH, SH3 and the upper lobe of the kinase domain. The putative structural
implications of all the missense mutations are given in the database showing
228 unique molecular events, including a novel missense mutation causing an
R28C substitution as previously seen in the Xid mouse.
X-linked agammaglobulinemia (XLA) is a hereditary immunodeficiency caused by
mutations in the gene coding for Bruton's agammaglobulinemia tyrosine kinase
(BTK) (
1
,
2
). Patients with XLA have a decreased number of mature B cells in their
peripheral blood and show a lack of all immunoglobulin isotypes causing
susceptibility to severe bacterial infections (
3
). The
BTK
gene is mapped to the midportion of the long arm of X-chromosome at Xq21.3-Xq22 (
4
-
7
). The 37.5 kb gene contains 19 exons, 18 of which code for a 77 kDa protein (
8
-
11
). BTK is expressed in all hematopoietic lineages except for T lymphocytes and
plasma cells (
12
). The murine gene for
Btk
has also been cloned and sequenced (
10
).
BTK is crucial for signaling in B cells (
13
,
14
). It belongs to a group of related cytoplasmic protein tyrosine kinases (PTKs)
formed by TEC (
15
) ITK/TSK/EMT (
16
,
17
) and BMX (
18
), known as the Tec family. The Tec family proteins consist of five distinct
structural domains (
1
,
2
,
19
,
20
), which are from the N-terminus, pleckstrin homology (PH) domain of ~120 amino acids, Tec homology (TH) domain (~60-80 residues), Src homology 3 (SH3) domain of ~60 residues, SH2 domain (~100 amino acids) and the catalytic kinase
domain of ~280 residues. The BTK protein is 659 residues long. Mutations in all the
five domains have been noticed to cause XLA (
21
-
23
). The structural consequences of the mutations in all the domains except for TH
have been studied based on computer-aided molecular modeling (
21
-
29
).
PH domains have been suggested to be involved in membrane (
30
), heterotrimeric G protein (
31
,
32
) and protein kinase C (
33
) binding. It has been suggested based on the mutational and structural
information that the domain has a 2-fold function (
14
,
27
). The N-terminal portion seems to be involved in membrane binding whereas the C-terminal [alpha]-helix and its extension in the adjacent TH domain are
responsible for G
[beta][gamma]
binding. The three dimensional structures of pleckstrin and spectrin PH domain
in complex with phosphoinositides (
30
,
34
) have indicated the residues that are responsible for binding. Protein kinase C
has been proposed to bind to the N-terminal half of the BTK PH domain (
33
). The first PH domain mutation was found in the Xid (X-linked immunodeficiency) mouse (
35
,
36
).
The TH domain has two distinct features (
19
,
20
). The N-terminal Btk motif of 27 residues seems to be an extension of the PH
domain. About 30 residues following the PH domain have been shown to be
important for [beta][gamma] binding (
31
). The whole TH domain is typical only for the Tec family (
20
), but the region corresponding to the Btk motif is important also for other [beta][gamma] binding PH domains (
31
,
37
). Although the PH domain extensions generally do not share significant sequence
similarity, they have a stretch of basic residues in common (
31
). This region is crucial for G
[beta][gamma]
binding in [beta]ARK (
38
) having effect on affinity (
37
). The C-terminal region of the TH domain contains one or two proline rich regions
(PRRs) which have been shown to interact with the SH3 domains of Src family
PTKs FYN, LYN and HCK (
39
). In TEC the corresponding region is recognized by the LYN SH3 domain (
40
). The PRRs form presumably polyproline type II helices which are known to
interact with SH3 domains.
The SH2 and SH3 domains each recognize short peptide motifs bearing either
phosphotyrosine (pTyr) residue or polyprolines, respectively. These compact
functional modules appear widely in signalling molecules, but SH2 domains are
explicitly involved only in PTK signalling pathways (
41
). These domains link BTK to partner molecules. Amino acids surrounding the pTyr
can increase the affinity by three orders of magnitude. SH3 domains bind
peptides and proteins that have a left-handed polyproline type II helix. Since these helices are almost
symmetrical, the direction of binding depends on the residues surrounding the prolines (
41
).
The kinase domain is the only catalytic region in the Tec family kinases. A
conserved ATP binding site locates between two structural lobes. The upper
lobe, which is formed mainly of [beta]-strands, has turned relative to the lower [alpha]-helical lobe in the inactive form of the enzyme. All
the known kinases contain several highly conserved residues, which are involved
in substrate and cofactor binding as well as in some structurally crucial
sites.
To coordinate BTK mutation analysis an international study group was established
in 1994 (
21
). BTK mutation data, both published and directly submitted information, has
been collected into a database called BTKbase (
21
-
23
,
42
-
44
). The database contains information about the mutations and XLA patients. The
study group gives for each patient an individual patient identity number (PIN).
The PIN consists of the type of mutation and a running number indicating
mutations affecting the same amino acid or the same non-coding region. The PIN is given as soon as a mutation is available to the
study group. Scientists are encouraged to contact the study group before
publishing their mutation data to get the PINs. Data can be kept confidential
until published. A more detailed description of the formation of PINs and an
example of a coded entry is given in Vihinen
et al.
(
45
).
The database contains the following information for each patient (when
available). The first lines are used to provide the necessary data for the
identification of the entry and plain English descriptions of the mutation.
These are followed by reference lines and lines which characterise formally the
mutation. Last are the lines describing various parameters from the patient.
Other immunodeficiency mutation databases (
46
-
49
) follow the same guidelines, which facilitates development of common tools,
e.g. for submission and analysis of data.
Table 1
There are altogether 368 patients in the database with XLA mutations that are
scattered along the BTK gene (Fig.
1
). The patients represent 318 unrelated families. The proportion of unique
mutations is 72% (228 cases), which is reduced compared with the previous
report. The distribution of the mutations in the five structural domains is
approximately according to the length of the domains (Fig.
1
). Exonic mutations are distributed as follows: 123 families have missense
mutations, 66 nonsense mutations, 24 insertions and 57 deletions (Table
1
). In addition there are 49 intron mutations affecting splice sites. Three
double mutations and a single triple mutatation have been detected. Mutations
confined to the promoter region have not been found. The gene defect of nine
gross deletions have not been characterized in detail. The figures are
calculated from the total number of identified mutations i.e. all the
alterations in the families having multiple mutations are taken into account.
The continuously updated database is freely available via anonymous ftp at
csb.ki.se in the directory pub/btkbase. Use anonymous as username and your e-mail address as password. World Wide Web distribution is available at
BTKbase . Inquiries and new
mutation data can be sent to mauno.vihinen{at}helsinki.fi, preferably by using the
submission form at the Web pages.
This work was supported by Biocentrum Helsinki, the Swedish Medical Research
Council, the Swedish Cancer Society, the Åke Wiberg Foundation, and European BIOMED concerted action `PL1321'.
Exon
Amino acids
Missense
Nonsense
Deletion
Insertion
2
1-47
10/23/29
3/9/9
2/2/2
1/1/1
3
48-80
2/2/3
1/1/1
7/9/9
3/7/7
4
81-103
1/1/1
1/1/1
1/1/1
5
104-130
2/2/2
1/1/1
3/3/3
6
131-173
2/2/2
6/6/7
5/5/7
1/1/1
7
174-196
1/1/1
1/1/1
3/4/4
2/4/4
8
197-259
1/1/1
3/9/11
7/7/8
1/1/1
9
260-280
1/1/1
1/1/1
1/1/1
10
281-298
2/7/9
2/3/4
11
299-325
2/5/5
1/1/1
4/5/6
12
326-367
6/6/9
2/3/3
1/1/1
13
368-392
1/1/1
2/4/4
1/1/1
1/1/1
14
393-450
5/6/7
3/4/5
1/1/1
15
451-522
10/25/25
6/13/14
3/3/3
1/1/3
16
523-544
5/8/15
1/2/2
6/9/9
17
545-583
6/11/11
2/2/2
2/2/2
1/1/1
18
584-636
13/16/24
5/5/7
3/3/3
3/3/3
19
637-659
5/7/7
1/1/1
1/1/1
Total
73/123/151
41/66/74
50/57/61
18/24/26
REFERENCES
Return
