Keywords:
stroke - genetics - biobank
Palavras-chave:
acidente vascular cerebral - genética - biobancos
According to the Global Burden of Disease Study, in 2013, there were almost 25.7 million
stroke survivors (71% with ischemic stroke (IS)), 6.5 million deaths from stroke (51%
died from IS), 113 million disability-adjusted life years due to stroke (58% due to
IS) and 10.3 million new strokes (67% IS) worldwide[1]. The INTERSTROKE study defined the population-attributable risks for ischemic and
hemorrhagic strokes in 22 countries, concluding that 10 classical cardiovascular risk
factors are associated with 90% of the risk of stroke[2]. However, twins and familial aggregation studies suggest that the risk of stroke
has a substantial genetic component[3]. Although genetics, and more recently genomics, play an increasingly large role
in the practice of medicine, the daily clinical care of patients suffering from stroke
has not significantly been affected by the advances in these fields. Therefore, one
can assume that there is still much knowledge about stroke genetics and genomics needing
to be translated into clinical practice.
Currently, there are tremendous efforts to understand the genetic basis of both rare
and common cardiovascular and stroke disorders through strategies such as genome-wide
association studies and next-generation sequencing studies[4]. In this scenario, it has been pointed out that well-established stroke biobanks
with close collaboration between clinicians and geneticists are essential for supplying
the biological and clinical information required for these large-scale studies[4],[5],[6]. To address these challenges specifically, recent cooperative efforts have been
launched such as the Stroke Genetics Network and Bio-Repository of DNA in Stroke[7],[8].
As defined by Brazilian regulatory guidelines, a biobank represents an organized collection
of human biological material and associated data that are prospectively collected
and stored for research purposes, in accordance with predefined technical, ethical
and operational standards, under institutional responsibility and management. We present
the study protocol and current status of the Joinville Stroke Biobank (JSB), the first
Latin American DNA biobank of stroke.
METHODS
Participants and samples
The current biobank originated from two sequential initiatives to study epidemiological
aspects of stroke in recent years in Brazil[9]. The phenotypic data were extracted from the Joinville Stroke Registry, which is
a population-based registry that has been ongoing since 2005. In 2013, the registry
became supported by a municipal law[10]. The beginning of case and control data ascertainment, blood extraction and DNA
storage took place in 2010. These samples and data, which are still being collected
and processed in Joinville, constitute phase I of the JSB, which has no deadline for
completion.
Phase II was launched in 2015, when the JSB started to receive blood samples and phenotypic
data from four other Brazilian cities. This constitutes a task force of the Brazilian
Consortium of Stroke Research, which is sponsored by the Brazilian Ministry of Health
and the National Council for Scientific and Technological Development, to define stroke
incidence trends, case-fatality proportions and the prevalence of classic cardiovascular
risk factors according to the WHO Steps criteria[11] in different geopolitical scenarios than Joinville, which has a higher Human Development
Index than other cities. Data and samples will be collected until 2017.
Central site and contributing sites
In JSB phase I, inpatients from five hospitals and out-patient data (mild strokes)
from the town of Joinville (515,288 inhabitants, 2010 census) were included, whereas
JSB phase II covered the following cities: Sobral, northeast region (three hospitals;
147,135 inhabitants), Campo Grande, central region (eight hospitals; 774,202 inhabitants),
Sertãozinho, south-east region (two hospitals; 101,784 inhabitants) and Canoas, south
region (three hospitals; 323,827 inhabitants), whose geographic locations are shown
in the [Figure]. The central site, as defined by the International Stroke Genetics Consortium is
located in the facilities at the University of the Region of Joinville.
Figure Localization of cities, according to Brazilian mesoregions, corresponding to central
and contributing sites of the Joinville stroke biobank.
Inclusion and exclusion criteria
Since the beginning of both phases, data and blood samples have been collected from
each case and related controls. Patients included transient ischemic attacks, subarachnoid
hemorrhages, ischemic strokes and hemorrhagic strokes. We included first-ever or recurrent
patients, regardless of age and sex, residing in one of the five cities. All patients
had at least one cranial tomography. Stroke confirmation and diagnosis were performed
as previously reported[9]. In brief, the phenotype of each stroke patient includes demographic information,
social class (according to the Brazilian Criteria of Economic Classification based
on the National Household Sample Survey)[12], years of education, type of work (manual or not), family history of stroke, cardiovascular
risk factors, biochemical tests and functional status (modified Rankin scale)[13]. Furthermore, IS subtypes were classified according to the modified Trial of Org
10172 in Acute Stroke Treatment (TOAST) criteria and Causative Classification System
for Ischemic Stroke (CCS) criteria[14],[15]. Patients with IS of undetermined cause due to negative investigation or cryptogenic
stroke were classified according to the Embolic Strokes of Undetermined Source international
work group[16]. A neuroradiologist classified all hemorrhagic strokes as lobar and non-lobar subtypes.
Controls were matched by age and gender to patients, and individuals with a previous
history of stroke or blood ties to patients were excluded. In cases of a positive
family history of stroke, blood samples of all related patients and non-stroke relatives
were searched for and nominated as nuclear cases.
Phenotyping methods
[Table 1] shows the diagnostic examination and the categories of data retrieved by questionnaires.
After 2012, the CCS criteria for phenotyping IS patients were also employed. The neurologists
of all cities in the contributing sites were trained by stroke neurologists from the
Joinville Stroke Registry using the platform available on the CCS website[15]. After the TOAST diagnosis, the clinical history, physical examination findings
and the results of diagnostic work-up were reviewed for determining the CCS classification.
Ischemic stroke patients with incomplete diagnostic examination were classified as
a possible IS subtype or received an IS diagnosis as undetermined with incomplete
evaluation. All Joinville IS diagnoses were blinded for adjudication by two authors
(FIR and NLC).
Table 1
Demographic and diagnostic examination data available in the Joinville Stroke Biobank.
Category
|
Description
|
Cases
|
Controls
|
|
Date of birth/age
|
Y
|
Y
|
|
Gender
|
Y
|
Y
|
|
Skin color
|
Y
|
Y
|
Sociodemographic
|
City of residence
|
Y
|
Y
|
|
Address
|
Y
|
Y
|
|
Social status
|
Y
|
N
|
|
Education level
|
Y
|
N
|
|
First-ever event?
|
Y
|
N
|
|
Date of enrollment
|
Y
|
N
|
|
Hour of enrollment
|
Y
|
N
|
|
Creatinine
|
Y
|
N
|
|
Triglyceride
|
Y
|
N
|
|
Uric acid
|
Y
|
N
|
Clinical and biochemical data
|
Blood glucose
|
|
N
|
Cholesterol (hdl/ldl)
|
|
N
|
|
National Institutes of Health Stroke Scale (admission)
|
Y
|
N
|
|
Barthel Index
|
Y
|
N
|
|
Rankin Index
|
Y
|
N
|
|
Oxfordshire Community Stroke Project classifcation (Banford)
|
Y
|
N
|
|
TOAST classifcation
|
Y
|
N
|
|
CCS classifcation
|
Y
|
N
|
|
Transient ischemic attack
|
Y
|
Y
|
|
Hypertension
|
Y
|
Y
|
|
Diabetes mellitus
|
Y
|
Y
|
|
Dyslipidemia
|
Y
|
Y
|
|
Acute myocardial infarction
|
Y
|
Y
|
Premorbid risk factors
|
Congestive heart failure
|
Y
|
N
|
|
Atrial fbrillation
|
Y
|
N
|
|
Smoking
|
Y
|
Y
|
|
Alcohol consumption **
|
Y
|
N
|
|
Drug consumption
|
Y
|
N
|
|
Physical activity
|
Y
|
N
|
|
Brain CT
|
Y
|
N
|
|
Brain MRI
|
Y
|
N
|
|
Digital angiography
|
Y
|
N
|
|
Extracranial vessels evaluation
|
|
|
|
Ultrasound Doppler
|
Y
|
N
|
|
CT – angiography
|
Y
|
N
|
|
MR – angiography
|
Y
|
N
|
Diagnostic examination
|
Intracranial vessels evaluation
|
|
|
|
Transcranial Doppler
|
Y
|
N
|
|
CT – angiography
|
Y
|
N
|
|
MR – angiography
|
|
|
|
Embolic sources
|
Y
|
N
|
|
ECG
|
Y
|
N
|
|
Transthoracic echocardiography
|
Y
|
N
|
|
Transesophageal echocardiography
|
Y
|
N
|
|
24 h Holter
|
Y
|
N
|
Outcome measures
|
Rankin scale (30 days, 6 months, once a year up to 5 years)
|
Y
|
N
|
Death (cause, date) by death certificate
|
Y
|
N
|
TOAST: Trial of Org 10172 in Acute Stroke Treatment criteria16; CCS: Causative Classification
System criteria17. CT: computed tomography; MRI: magnetic resonance imaging. Y: yes
(available); N: no (not available).
*Self-declared skin color
**current moderate or heavy drinker was defined as 50 g/day, which is equivalent to
500 mL or two glasses of wine, 1000 mL of beer or 5 units of spirits, or being intoxicated
at least once a week.
Data and blood sample collection
Patient selection and blood collection were performed by neurologists and nurses,
respectively. Research nurses invited the controls to enroll from among the accompanying
persons willing to participate who fulfilled the inclusion criteria. After clarifying
the aims of the JSB and the corresponding roles of all parties involved, blood sampling
(phase I: 12 mL by venous puncture employing common EDTA-containing vacuum tubes;
phase II: finger puncture followed by drying and storage on an FTA Elute Card®, Whatman, Kent, United Kingdom) of patients and controls was performed by research
nurses. Baseline data included self-declared skin color, collected in strict accordance
with the mandatory procedures established by the Brazilian Institute of Geography
and Statistics for purposes of population census. Demographics, socio-economic data,
cardiovascular risk factors, National Institutes of Health Stroke Scale (clinical
stroke severity) and functional outcome can be extracted from the Joinville Stroke
Registry, which also includes causes and dates of death. The follow-up of patients
is being carried out by phone calls by previously-trained research nurses in one,
three, six, nine and 12 months in the first year and once a year for the following
four years after stroke diagnosis. Patients are asked about regular visits to a general
physician, blood pressure control, results of glycated hemoglobin and cholesterol
levels, and smoking habits.
An identification code, which comprises numbers that denote the date of recruitment,
contributing site and entry order, was assigned to each patient enrolled. Controls
were identified using numbers that keep their relationship with each matched patient.
These identification codes link clinical data and demographics with deposited DNA
samples and only the biobank managers or authorized personnel have access to the key
linking samples and phenotypic data.
Processing of samples, DNA extraction and quality control
Blood samples from phase I were maintained at 4°C until DNA extraction. Whole blood
samples were centrifuged for 10 minutes at 2,500 x g for separation of the buffy coat,
which contains most of the white blood cells. Thereafter, genomic DNA was obtained
by the classic “phenol-chloroform” procedure for total DNA extraction. The DNA was
resuspended in 200 μL of buffer TE (10 mM Tris-HCl; 1 mM EDTA; pH 8.0). For phase
II, we adopted the procedure for large-amount DNA extraction, starting with three
paper punches of 3 mm circles, as recommended by the manufacturer of the FTA Elute
Card®. The purity and yield of DNA extracted were evaluated by spectrophotometry at 260
and 280 nm (Epoch, Biotek Instruments, Winooski, USA). Long-term storage was considered
for A260/A280 ratios in the range 1.8–2.0. Finally, DNA was stored in aliquots at −80°C.
Ethical considerations
The JSB development plan and the informed consent forms were both approved by the
Institutional Review Board of the University of the Region of Joinville and by the
Brazilian National Commission for Ethics in Research (protocol 25000.142907/2013-07),
which holds the final approval prerogative for biobanks in Brazil. The adopted standards
and procedures strictly follow the ethical rules that came into force in 2011 and
are described in Resolution 441/11 of the National Health Council and in Ordinance
2.201 of the Brazil Ministry of Health, which correspond to the national guidelines
for biorepositories and biobanks of human biological material for research purposes.
The JSB also follows the international ethical regulations. All volunteers (patients
and controls) are enrolled after written informed consent has been given freely by
the participants or their legal representatives. The freedom to refuse participation
or withdraw at any time is guaranteed, and the confidentiality of personal data is
ensured in all circumstances. Free access to information and associated implications
is guaranteed to participants, thereby respecting the expression of individual will.
RESULTS
Since 2010, a total of 5,970 DNA samples have been registered belonging to patients
(2,688) and controls (3,282). The mean ages of patients and controls were 64.6 ± 19.3
and 56.9 ± 14.9 years, respectively. Men were more represented among the patients
(64%; 1,731/2,688), while women were more prevalent among controls (68%; 2,226/3,282).
[Table 2] shows the distribution of patients and controls per city, as well as their self-declared
skin color. White individuals predominated in the south (88%) and south-east (55%)
regions and brown in the north-east (53%) and central (41%) regions.
Table 2
Baseline characteristics of stroke patients and controls in the Joinville Stroke Biobank
database.
City
|
Campo Grande
|
Canoas
|
Joinville
|
Sertãozinho
|
Sobral
|
All
|
Demography
|
|
|
|
|
|
|
Cases (n)
|
507
|
187
|
1619
|
175
|
200
|
2688
|
Mean age (±SD) (years)
|
65.2 (14.9)
|
64.5 (14.3)
|
64.3 (14.1)
|
64.9 (15.1)
|
6 7.1 (16.1)
|
64.6 (19.3)
|
Male [n(%)]
|
290 (57.2)
|
100 (53.5)
|
1185 (73.2)
|
56 (32)
|
100 (50)
|
1731 (64.4)
|
Skin color [% (n/N)]
|
|
|
|
|
|
|
|
White
|
46.9
|
49.7
|
88.2
|
55.4
|
29.5
|
71.2
|
(238/507)
|
(93/187)
|
(1428/1619)
|
(97/175)
|
(59/200)
|
(1915 /2688)
|
|
Black
|
7.7
|
16.0
|
5.4
|
13.1
|
16.0
|
7.8
|
(39/507)
|
(30/187)
|
(87/1619)
|
(23/175)
|
(32/200)
|
(211/2688)
|
|
Brown
|
41.4
|
20.9
|
6.0
|
29.1
|
53.5
|
18.8
|
(210/507)
|
(39/187)
|
(97/1619)
|
(51/175)
|
(107/200)
|
(504/2688)
|
|
Native American
|
2.0
|
2.7
|
0.2
|
1 .1
|
0.5
|
0.8
|
|
(10/507)
|
(5/187)
|
(4/1619)
|
(2/175)
|
(1/200)
|
(22/2688)
|
|
Yellow
|
1. 4
|
0.5
|
0 .1
|
1 .1
|
0.5
|
0.5
|
|
(7/507)
|
(1/187)
|
(2/1619)
|
(2/175)
|
(1/200)
|
(13/2688)
|
|
Not-declared
|
0.6
|
10.2
|
0 .1
|
0.0
|
0.0
|
0.9
|
|
(3/507)
|
(19/187)
|
(1/1619)
|
0.0
|
0.0
|
(23/2688)
|
Controls (n)
|
180
|
24
|
2586)
|
276
|
216
|
3282
|
Mean age (±SD) years)
|
50.2 (14.6)
|
64.56 (13.1)
|
68.3 (14.7)
|
50.2 (13,9)
|
66.3 (15,5)
|
56.9 (14.9)
|
Male [n(%)]
|
61 (33.8)
|
14 (58.3)
|
865 (33.5)
|
65 (23.5)
|
51 (23.6)
|
1056 (32.2)
|
Skin color [% (n/N)]
|
|
|
|
|
|
|
|
White
|
53.9
|
62.5
|
90.4
|
46.7
|
32.9
|
80.7
|
|
(97/180)
|
(15/24)
|
(2338/2586)
|
(129/276)
|
(71/216)
|
(2650/3282)
|
|
Black
|
0.6
|
25.0
|
2.7
|
6.9
|
6.5
|
3.4
|
|
(1/180)
|
(6/24)
|
(71/2586)
|
(19/276)
|
(14/216)
|
(111/3282)
|
|
Brown
|
40.0
|
8.3
|
6.4
|
44.2
|
4 7.7
|
14.2
|
|
(72/180)
|
(2/24)
|
(166/2586)
|
(122/276)
|
(103/216)
|
(465/3282)
|
|
Native American
|
0.6
|
4.2
|
0.0
|
0.4
|
0.9
|
0.2
|
|
(1/180)
|
(1/24)
|
(1/2586)
|
(1/276)
|
(2/216)
|
(6/3282)
|
|
Yellow
|
0.6
|
0.0
|
0 .1
|
1. 4
|
0.0
|
0.2
|
|
(1/180)
|
0
|
(3/2586)
|
(4/276)
|
0
|
(8/3282)
|
|
Not-declared
|
4.4
|
0.0
|
0.3
|
0.4
|
12.0
|
1. 3
|
|
(8/180)
|
0
|
(7/2586)
|
(1/276)
|
(26/216)
|
(42/3282)
|
[Table 3] shows the distribution of stroke diagnoses of patients in the different cities.
Most of the patients (76%; 2,031/2,688) were IS, followed by 12% (319/2,668) transient
ischemic attacks, 9% (243/2,668) hemorrhagic stroke and 3% (95/2,688) subarachnoid
hemorrhage. A quarter of all stroke patients were aged ≤ 55 years.
Table 3
Major stroke diagnosis of patients in the Joinville Stroke Biobank database per city
[% (n/N)].
Variable
|
City
|
Campo Grande
|
Canoas
|
Joinville
|
Sertãozinho
|
Sobral
|
All
|
IS
|
16.5 (335/2031)
|
7. 5 (152/2031)
|
60.9 (1238/2031)
|
5.6 (112/2031)
|
9.5 (194/2031)
|
75.5 (2031/2688)
|
TIA
|
20.6 (66/319)
|
1.6 (5/319)
|
73.1 (233/319)
|
4.7 (15/319)
|
0
|
11.9 (319/2688)
|
HS lobar
|
-
|
-
|
11.7 (28/243)
|
0
|
-
|
-
|
HS non-lobar
|
-
|
-
|
7. 8 (19/243)
|
0
|
-
|
-
|
HS non-classified
|
34.6 (84/243)
|
11.1 (27/243)
|
20.1 (49/243)
|
13.9 (34/243)
|
0.8 (2/243)
|
9.1 (243/2688)
|
SAH
|
23.1 (22/95)
|
3.5 (3/95)
|
54.6 (52/95)
|
14.6 (14/95)
|
4.2 (4/95)
|
3.5 (95/2688)
|
Young adult (≤ 55 years old)
|
4.3 (118/2688)
|
0.4 (13/2668)
|
15.7 (424/2688)
|
1.9 (52/2688)
|
1.9 (53/2688)
|
24.5 (660/2688)
|
Nuclear cases
|
0
|
0
|
0.4 (7/1619)
|
0
|
0
|
0.4 (7/2688)
|
All cases
|
18.9 (507/2688)
|
6.9 (187/2688)
|
60.2 (1619/2688)
|
6.5 (175/2688)
|
7. 5 (200/2688)
|
2688
|
IS: ischemic stroke; TIA: transient ischemic attack; HS: hemorrhagic stroke; SAH:
subarachnoid hemorrhage.
The distribution of IS cases by TOAST criteria ([Table 4]) was: 20% (426/2,031) cardioembolic; 22% (440/2,031) small-artery occlusion; 16%
(317/2,031) large-artery atherosclerosis; 40% undetermined (807/2,031) and 2% (41/2,031)
other determined cause. After 2012, patients were also classified by CCS criteria
(56%; 1,503/2,688). As expected, among IS patients, the most common IS subtype was
undetermined, not only by TOAST criteria (40%; 807/2,031), but also by CCS criteria
(47%; 718/1,503). Out of those 807 undetermined IS patients classified by TOAST criteria,
55% (446/807) were patients with an incomplete evaluation, 42% (340/807) were cryptogenic
and 3% (21/807) were undetermined with two or more possible causes. Out of those 718
undetermined IS subtypes classified by CCS criteria, 25% (370/1,503) were IS patients
with incomplete evaluation, 17% (262/1,503) were unknown other cryptogenic, 8% (61/1,503)
were unknown cryptogenic embolism and 1% (25/1,503) were unclassified.
Table 4
Ischemic stroke subtypes diagnosis of patients in the Joinville Stroke Biobank database
[% (n/N)].
Variable
|
City
|
Campo Grande
|
Canoas
|
Joinville
|
Sertãozinho
|
Sobral
|
All
|
IS subtype by TOAST
|
|
|
|
|
|
|
|
Large-artery atherosclerosis
|
6.9 (22/317)
|
5 .1 (16/317)
|
81.8 (259/317)
|
2.8(9/317)
|
3.4 (11/317)
|
15.8 (317/2031)
|
|
Cardioembolic
|
12.2 (52/426)
|
3.2 (14/426)
|
78.3 (333/426)
|
2.8(12/426)
|
3.5 (15/426)
|
20.2 (426/2031)
|
|
Small-artery occlusion
|
17.1 (75/440)
|
4.3 (19/440)
|
6.9 (307/440)
|
0.9(4/440)
|
7. 9 (35/440)
|
21.7 (440/2031)
|
|
Other determined cause
|
17.1(7/41)
|
0
|
82.9 (34/41)
|
0
|
0
|
2.2(41/2031)
|
Undetermined
|
|
|
|
|
|
|
|
Cryptogenic/ESUS
|
5.6 (19/340)
|
16.2 (55/340)
|
51.8 (176/340)
|
2.6 (9/340)
|
23.8 (81/340)
|
16.8 (340/2031)
|
|
Incomplete investigation
|
35.6(159/446)
|
10.8 (48/446)
|
24.6 (110/446)
|
17.5 (78/446)
|
11.4 (51/446)
|
22.1 (446/2031)
|
Two or more determined causes
|
4.7 (1/21)
|
0
|
90.6 (19/21)
|
0
|
4.7 (1/21)
|
1.2 (21/2031)
|
IS subtype by CCS
|
|
|
|
|
|
|
|
Large-artery atherosclerosis
|
|
|
|
|
|
|
|
|
Evident
|
5.3 (8/150)
|
10.7 (16/150)
|
76.7 (115/150)
|
0
|
7. 3 (11/150)
|
9.9 (150/1503)
|
|
|
Probable
|
0
|
0
|
92.1(35/38)
|
7.9(3/38)
|
0
|
2.4 (38/1503)
|
|
|
Possible
|
12(3/25)
|
0
|
64 (16/25)
|
24 (6/25)
|
0
|
1.5 (25/1503)
|
Cardioembolic
|
|
|
|
|
|
|
|
Evident
|
11.7 (18/154)
|
7.1 (11/154)
|
75.3 (116/154)
|
3.2 (5/154)
|
2.6 (4/154)
|
10.2 (154/1503)
|
|
Probable
|
0
|
0
|
100 (40/40)
|
0
|
0
|
2.5 (40/1503)
|
|
Possible
|
15.7 (8/51)
|
0
|
49.1 (25/51)
|
13.6(7/51)
|
21.6 (11/51)
|
3.3 (51/1503)
|
|
Small-artery occlusion
|
|
|
|
|
|
|
|
|
Evident
|
6.8(12/175)
|
10.3 (18/175)
|
6.5 (113/175)
|
5.7(1/175)
|
17.7 (31/175)
|
11.5 (175/1503)
|
|
|
Probable
|
15.1 (10/66)
|
0
|
8.2(54/66)
|
3 .1 (2/66)
|
0
|
4.3 (66/1503)
|
|
|
Possible
|
2.1(1/48)
|
0
|
87.5 (42/48)
|
2.1(1/48)
|
8.3(4/48)
|
3.1 (48/1503)
|
|
Other determined cause
|
|
|
|
|
|
|
|
|
Evident
|
11.1 (4/36)
|
0
|
86.2(31/36)
|
0
|
2.7 (1/36)
|
2.3 (36/1503)
|
|
|
Probable
|
0
|
0
|
100(2/2)
|
0
|
0
|
1.2 (2/1503)
|
|
|
Possible
|
0
|
0
|
0
|
0
|
0
|
0
|
Undetermined
|
|
|
|
|
|
|
|
|
Unknown cryptogenic embolism
|
0
|
0
|
80.3(49/61)
|
0
|
19.7 (12/61)
|
4.1 (61/1503)
|
|
|
Unknown other cryptogenic
|
6.1 (16/262)
|
20.2(53/262)
|
44.6 (117/262)
|
26.7(7/262)
|
26.3 (69/262)
|
17.4 (262/1503)
|
|
|
Incomplete evaluation
|
27.6 (102/370)
|
12.9 (48/370)
|
24.3 (90/370)
|
21.3 (79/370)
|
13.8 (51/370)
|
24.6 (370/1503)
|
|
|
Unclassified
|
4(1/25)
|
24(6/25)
|
64 (16/25)
|
4 (1/25)
|
4 (1/25)
|
1.7 (25/1503)
|
Total IS by TOAST (since 2010)
|
16.5 (335/2031)
|
7. 5 (152/2031)
|
60.9 (1238/2031)
|
5.6 (112/2031)
|
9.5 (194/2031)
|
75.5 (2031/2688)
|
Total IS by CCS (since 2012)
|
12.2 (183/1503)
|
10.1 (152/1503)
|
57.3 (861/1503)
|
74.5 (112/1503)
|
12.9 (195/1503)
|
72. 0(1503/2100)
|
IS: ischemic stroke; TOAST: Trial of Org 10172 in Acute Stroke Treatment criteria16;
ESUS: Embolic Strokes of Undetermined Source; CCS: Causative Classification System
criteria17.
DISCUSSION
Up to October 2016, DNA from 5,970 blood samples distributed among patients (2,688)
and controls (3,282) had been extracted and stored. Since it is believed that most
of the genetic variants identified in stroke studies will have small, individual effects
on disease risk, efforts aimed at the successful discovery of the impact of genes
have required large sample sizes (usually involving tens or hundreds of thousands
of cases and controls) to achieve sufficient statistical power[10]. In addition, as with any case-control study, controls should be representative
of cases[17],[18].
With this in mind, the JSB comprises well-defined phenotypes and samples of high-quality
DNA. The priority recruitment of spouses or partners as case-matched controls represents
a suitable strategy for minimizing differences due to environmental effects to which
the participants (patients and controls) have been exposed. As controls are collected
in the same geographic region as patients, it is reasonable to expect a homogeneous
distribution of ethnic backgrounds between patients and controls. On the other hand,
despite having inflated influences due to shared variants and environmental effects
between patients and controls, the JSB may also provide relevant information on familial
aggregation analysis. It is also noteworthy to clarify that the JSB has a forecast
of unlimited continuity as, among other factors, the Joinville Stroke Registry is
supported by a municipal law. This gives rise to the fact that if there are cases
of stroke in individuals previously categorized as a control (since stroke events
may happen later in life), their biobank status changes, but maintains the traceability
of the change. This allows the possibility of evaluation of the impact on results
of any studies already performed with the data and samples of those individuals who
no longer belong to the control group. As far as we know, this is the first stroke
biobank in Latin America, a region with a massive race mixture.
Under TOAST, IS cases are assigned to one of three main subtypes (small-artery occlusion,
large-artery atherosclerosis, cardioembolic); to a rare cause (e.g., carotid or vertebral
artery dissection); or remain undetermined (because of more than one potential cause,
incomplete investigation or no apparent cause despite complete investigation/cryptogenic
stroke). The proportion in the undetermined category varies, but can be substantial
(up to 40% or more), and is usually excluded from ischemic subtype-specific analyses[19],[20]. In our cohort, between 22% (TOAST criteria) and 25% (CCS criteria) were undetermined
IS due to incomplete investigation. All patients with incomplete investigation were
assigned this diagnosis. In fact, the highly diverse sociodemographic scenarios at
sites contributing to the JSB, including difficulties related to health-care system
access and imaging diagnostic workup, might have influenced significantly. Nevertheless,
independent raters may perform blinded readjudications of all JSB IS diagnoses whenever
considered necessary.
Globally, large DNA biobanks have been contributing to the understanding of the different
genetic architectures and pathophysiological aspects of stroke. In this regard, the
UK Biobank (European and British Asian), Brains-SA (India and Sri Lanka), Bio-Repository
of DNA in Stroke – Middle East (Qatar)[21], China Kadoorie Biobank[22] and other collaborative initiatives, such as the International Stroke Genetics Consortium,
have contributed to the identification of several loci associated with IS subtypes.
Genome-wide association study approaches have been adopted for investigating many
complex diseases, such as stroke, and have been effective in identifying new genetic
variants associated with the risk of disease. However, for stroke studies, genome-wide
association studies have so far been applied principally in European, North American
and Japanese populations[3],[23].
In particular, four loci (PITX2, HDAC9, ZFHX3 and 12q24.2) were convincingly implicated by genome-wide association studies as associated
with IS in European-ancestry patients[24],[25]. Recently, a novel locus (rs12122341) was identified at 1p13.2 (near TSPAN2), associated with large-artery atherosclerosis stroke, and the locus 12q24 (near
ALDH2) was associated with small-artery occlusion[26]. In addition, the gene TRPV3 was associated with cardioembolic stroke after replication of exome sequencing analysis,
demonstrating that two polymorphisms were associated with cardioembolic stroke risk
in two cohorts, with rs151091899 being the most significant[27]. Therefore, replication studies in other populations are determinants in understanding
the significance of such polymorphisms as stroke risk factors. The Brazilian population
is one of the most heterogeneous populations in the world, comprising an admixture
of Native Americans, Europeans and Africans[28]. The admixture process occurred through different means in Brazilian geographic
regions. For instance, the Native American contribution is more pronounced in northern
Brazil, the African contribution is more elevated in the north-east, and the south
features a European predominance with few Native American and African influences[29]. Consequently, the JSB, unlike other biobanks[8],[22],[23], comprises an ideal admixed population with great ethnic variability linked to epidemiological
and clinical data, which may allow confirmation of polymorphisms found in specific
populations, contributing to global understanding of genetic mechanisms involved in
stroke.
In conclusion, the clinical, environmental and epidemiological issues related to the
different types of stroke, combined with the large genetic variability captured by
the JSB, may increase the chance of identifying new and relevant factors predisposing
individuals to stroke (and recovery) in Brazil. Our aim is to continue patient assessment
and blood collection, as well as begin genotyping samples in the near future. Some
initial studies are already being conducted in the Laboratory of Molecular Genetics
at the University of Campinas with the support of the Brazilian Institute of Neuroscience
and Neurotechnology and the use of a genome-wide association genotyping platform (Affymetrix™
6.0).