INDEX
Explanations
verbs expressing action or judgement
phrases related to criticism and accountability
New Auto-Interp
Negative Logits
inger
-0.72
ilogy
-0.68
ciating
-0.68
talking
-0.65
Depending
-0.64
DragonMagazine
-0.64
Reporting
-0.63
gui
-0.62
thanking
-0.60
orget
-0.58
POSITIVE LOGITS
insensitive
0.77
BuyableInstoreAndOnline
0.74
sake
0.72
improper
0.70
ãĤ¤ãĥĪ
0.69
insufficient
0.68
unsu
0.65
purposes
0.65
illegal
0.64
unconventional
0.64
Activations Density 0.176%