INDEX
Explanations
references to diversity or variety in context
New Auto-Interp
Negative Logits
ensee
-0.62
rachtet
-0.56
Vors
-0.53
hält
-0.53
Lue
-0.52
हा
-0.52
Wies
-0.51
it
-0.51
zwy
-0.51
eaky
-0.51
POSITIVE LOGITS
تضيفلها
0.93
various
0.92
zości
0.90
assorted
0.88
>=",
0.87
various
0.86
varied
0.84
sundry
0.84
verschiedener
0.84
assorted
0.83
Activations Density 0.116%