INDEX
Explanations
proper nouns
verbs indicating existence or presence
New Auto-Interp
Negative Logits
ank
-0.79
imar
-0.71
isman
-0.68
exceeds
-0.68
author
-0.66
wake
-0.65
belie
-0.65
hadn
-0.65
avoids
-0.64
Author
-0.64
POSITIVE LOGITS
GROUND
0.72
ãĥ¼ãĥ³
0.69
Remy
0.68
ãĤĮ
0.65
tnc
0.65
FTWARE
0.64
Sanct
0.64
gaping
0.64
ominous
0.62
beaut
0.61
Activations Density 0.423%