INDEX
Explanations
phrases related to intelligence or cleverness
New Auto-Interp
Negative Logits
bered
-0.74
atern
-0.72
ICAN
-0.69
packages
-0.68
avez
-0.67
artifacts
-0.66
dates
-0.64
leased
-0.64
ETA
-0.63
avored
-0.62
POSITIVE LOGITS
thinker
1.00
sonian
0.93
ctl
0.92
minded
0.85
smanship
0.84
thinkers
0.84
guy
0.83
found
0.81
smartest
0.79
ness
0.76
Activations Density 0.993%