INDEX
Explanations
expressions of difficulty or challenge
New Auto-Interp
Negative Logits
Äįel
-0.18
çľī
-0.16
oya
-0.15
irit
-0.15
aku
-0.14
iren
-0.14
instincts
-0.13
llum
-0.13
Interop
-0.13
areth
-0.13
POSITIVE LOGITS
imagine
0.50
picture
0.50
imag
0.49
imaging
0.49
imagination
0.47
image
0.46
Imaging
0.43
Imagine
0.43
Imagine
0.41
Picture
0.41
Activations Density 0.203%