INDEX
Explanations
phrases related to global or worldwide entities or concepts
New Auto-Interp
Negative Logits
ĪĴ
-0.84
ADRA
-0.77
¬¼
-0.74
regon
-0.73
hift
-0.73
zes
-0.72
POR
-0.71
zin
-0.71
okane
-0.71
irc
-0.69
POSITIVE LOGITS
(~
0.74
besides
0.74
according
0.71
today
0.69
compared
0.67
âĶ
0.66
bestowed
0.65
excluding
0.64
surpassed
0.64
EVER
0.63
Activations Density 0.112%