INDEX
Explanations
words that indicate confirmation or agreement
New Auto-Interp
Negative Logits
Middle
-0.52
ditemui
-0.47
خارجية
-0.47
Middle
-0.47
or
-0.46
sementara
-0.46
aidé
-0.45
BrowserModule
-0.45
ırl
-0.45
combina
-0.44
POSITIVE LOGITS
о
1.13
об
1.10
IntoConstraints
0.92
عن
0.92
apie
0.86
về
0.85
Για
0.81
Tentang
0.80
despre
0.80
ostavi
0.79
Activations Density 0.051%