INDEX
Explanations
words related to reflection and evaluation of personal performance and progress
New Auto-Interp
Negative Logits
ILCS
-0.84
zens
-0.69
intest
-0.66
ophon
-0.64
itals
-0.63
ESH
-0.62
civil
-0.62
cyan
-0.61
loads
-0.60
unequ
-0.60
POSITIVE LOGITS
Regarding
0.89
regarding
0.85
naire
0.84
arial
0.82
zzle
0.80
umar
0.80
ially
0.80
concerning
0.76
naires
0.75
aloud
0.74
Activations Density 9.620%