INDEX
Explanations
phrases starting with "Not so" and "Much"
phrases that indicate a negation or contradiction
New Auto-Interp
Negative Logits
ãĥ¼ãĥĨãĤ£
-0.76
ãĤ¼ãĤ¦ãĤ¹
-0.66
MAP
-0.63
DRAGON
-0.59
idential
-0.57
SHARES
-0.57
Places
-0.56
Unable
-0.56
Maps
-0.56
{:-0.55
POSITIVE LOGITS
anymore
0.89
much
0.88
ppy
0.83
othes
0.80
apy
0.80
imilar
0.76
oths
0.74
igne
0.71
aked
0.71
oooo
0.71
Activations Density 0.040%