INDEX
Explanations
instances of variation in language and tone, reflecting a casual or informal style
New Auto-Interp
Negative Logits
égration
-0.36
efectivamente
-0.31
Bühne
-0.31
intégrée
-0.31
castigo
-0.31
either
-0.30
rangian
-0.30
either
-0.30
automatiques
-0.30
чти
-0.29
POSITIVE LOGITS
einf
0.70
plain
0.68
Plain
0.67
Plain
0.66
simply
0.64
styleType
0.62
חיצוניים
0.61
general
0.60
saraba
0.60
Exacts
0.59
Activations Density 0.427%