INDEX
Explanations
phrases focused on various forms of critical and analytical thinking
New Auto-Interp
Negative Logits
chnitt
-0.18
idth
-0.17
509
-0.16
ानन
-0.16
crest
-0.15
ibu
-0.15
@}
-0.15
annah
-0.15
insky
-0.15
zcze
-0.14
POSITIVE LOGITS
_ENUM
0.16
audi
0.15
quin
0.15
ione
0.14
labs
0.14
iku
0.14
Ùħست
0.14
exercises
0.14
abstract
0.14
ToProps
0.14
Activations Density 0.015%