INDEX
Explanations
mentions of specific individuals or places
unique symbols or characters
New Auto-Interp
Negative Logits
comprom
-0.96
mathemat
-0.77
mosqu
-0.71
cob
-0.69
photoc
-0.69
interf
-0.68
Ambro
-0.68
Thomson
-0.66
Unic
-0.66
ensical
-0.65
POSITIVE LOGITS
Ļ
1.39
¬
1.22
¤
1.22
ħ
1.18
Ħ¢
1.15
ĺ
1.14
ı
1.12
¡
1.10
ª
1.10
Ķ
1.08
Activations Density 0.182%