INDEX
Explanations
words related to decision-making and contemplation
terms related to personality traits or characteristics
New Auto-Interp
Negative Logits
ONSORED
-0.82
Rossi
-0.76
ctrl
-0.71
-+-+
-0.70
20439
-0.65
Firefly
-0.64
Ľ
-0.63
Ferdinand
-0.62
eger
-0.62
Reloaded
-0.61
POSITIVE LOGITS
onna
0.65
owship
0.64
cular
0.63
aining
0.63
ained
0.62
angled
0.61
sen
0.61
gencies
0.61
iating
0.60
ATA
0.59
Activations Density 0.000%