INDEX
Explanations
phrases indicating possession or specification of items or concepts
New Auto-Interp
Negative Logits
SOUNDBITE
-0.56
Gizmos
-0.55
Shimizu
-0.47
Controllo
-0.44
some
-0.44
△
-0.43
adaptiveStyles
-0.43
something
-0.42
etwas
-0.42
pushFollow
-0.42
POSITIVE LOGITS
argint
0.65
MANY
0.63
many
0.61
Many
0.60
Many
0.60
vostri
0.57
meisten
0.56
majority
0.56
flesta
0.56
maioria
0.56
Activations Density 0.009%