INDEX
Explanations
words related to theft or burglary
references to the city of Burgos
New Auto-Interp
Negative Logits
âĢ¢âĢ¢
-0.79
Gemini
-0.72
partName
-0.68
filibuster
-0.63
Apollo
-0.62
*/(
-0.61
hift
-0.61
hypers
-0.58
needle
-0.57
=-=-=-=-
-0.56
POSITIVE LOGITS
undy
1.25
Burg
1.15
burg
1.09
dor
0.95
rats
0.90
lar
0.90
erville
0.90
hers
0.79
ansk
0.79
nut
0.79
Activations Density 0.003%