INDEX
Explanations
references to intelligence and cleverness
New Auto-Interp
Negative Logits
ToBounds
-0.15
wards
-0.15
apia
-0.15
rones
-0.14
andum
-0.14
ignKey
-0.14
entes
-0.14
endale
-0.14
phans
-0.14
tega
-0.14
POSITIVE LOGITS
èģ
0.18
Pearce
0.17
complex
0.14
ifter
0.14
.hm
0.14
ivi
0.14
con
0.14
Smarty
0.14
complex
0.14
ende
0.14
Activations Density 0.026%