INDEX
Explanations
references to events or activities taking place at specific locations
occurrences of the phrase "take place."
New Auto-Interp
Negative Logits
apo
-0.72
edded
-0.67
rouse
-0.65
rc
-0.65
ooks
-0.65
incinn
-0.64
illard
-0.64
cest
-0.64
idth
-0.63
oneliness
-0.63
POSITIVE LOGITS
Ú
0.86
Ò
0.85
ÑĮ
0.77
ãĤ¯
0.77
ãĥĥãĤ¯
0.77
VK
0.75
Ö
0.74
bos
0.73
ãĤ¦
0.72
Parameter
0.71
Activations Density 0.022%