INDEX
Explanations
names or terms related to a specific individual
the mention of specific names related to an individual involved in significant events
New Auto-Interp
Negative Logits
rified
-0.74
mitting
-0.70
mort
-0.69
conscious
-0.69
ienced
-0.68
aminer
-0.68
filled
-0.67
tremend
-0.67
lasses
-0.66
cribed
-0.66
POSITIVE LOGITS
Äĩ
0.92
yah
0.88
ason
0.82
ya
0.82
cki
0.82
asing
0.80
zon
0.77
Äį
0.77
ensis
0.74
ela
0.74
Activations Density 0.011%