INDEX
Explanations
names of individuals involved in criminal activities
proper nouns, specifically names of individuals
New Auto-Interp
Negative Logits
mathemat
-0.74
izoph
-0.70
soDeliveryDate
-0.70
Temperature
-0.67
favourable
-0.67
hemisphere
-0.64
patience
-0.63
pathological
-0.63
translation
-0.63
Compared
-0.63
POSITIVE LOGITS
Jr
1.44
III
1.18
Sr
1.07
vich
0.94
burgh
0.93
iewicz
0.91
baum
0.91
aka
0.90
owski
0.89
icz
0.88
Activations Density 0.250%