INDEX
Explanations
references to real-life events and situations
New Auto-Interp
Negative Logits
istrovstvÃŃ
-0.17
xfb
-0.15
etxt
-0.14
iant
-0.14
iales
-0.14
YYS
-0.14
conds
-0.14
cept
-0.14
éric
-0.14
stvo
-0.13
POSITIVE LOGITS
counterpart
0.18
imony
0.16
eco
0.15
gear
0.15
counterparts
0.15
bat
0.14
augment
0.14
like
0.14
occurrences
0.14
.opensource
0.14
Activations Density 0.046%