INDEX
Explanations
mentions of specific names or surnames, especially when followed by numbers
repeated mentions of a particular name or term associated with people or entities
New Auto-Interp
Negative Logits
glim
-0.75
Subject
-0.73
IDENT
-0.73
glass
-0.71
narrator
-0.70
Þ
-0.69
à¨
-0.68
icist
-0.66
IAL
-0.66
iPads
-0.66
POSITIVE LOGITS
zzi
1.26
ppa
1.06
zzo
1.03
pps
1.03
ichi
0.96
eta
0.93
opa
0.92
ÅĤ
0.92
ek
0.91
ven
0.91
Activations Density 0.008%