INDEX
Explanations
words expressing positivity or greetings
Good followed by other words
New Auto-Interp
Negative Logits
ujednoznacz
-0.45
rungsseite
-0.42
ябре
-0.41
dule
-0.38
Access
-0.37
tissement
-0.37
Biographie
-0.37
internally
-0.36
Celle
-0.36
access
-0.35
POSITIVE LOGITS
Good
0.99
Good
0.93
GOOD
0.85
GOOD
0.83
good
0.82
good
0.81
dobré
0.73
buenas
0.68
goede
0.68
bonnes
0.66
Activations Density 0.011%