INDEX
Explanations
occurrences of the word "exchange" and its variants in various contexts
New Auto-Interp
Negative Logits
li
-0.16
amate
-0.15
azzo
-0.15
odore
-0.15
fil
-0.15
led
-0.14
anners
-0.14
ouden
-0.14
¬
-0.14
arga
-0.14
POSITIVE LOGITS
frau
0.17
anter
0.17
ept
0.15
esin
0.15
able
0.15
istrovstvÃŃ
0.15
ois
0.14
esa
0.14
iros
0.14
CompleteListener
0.14
Activations Density 0.019%