INDEX
Explanations
references to reflection and perception
New Auto-Interp
Negative Logits
Rede
-0.14
ÐłÐ¾Ð´
-0.14
opsis
-0.14
arken
-0.14
ruk
-0.13
icolon
-0.13
Rewrite
-0.13
Reusable
-0.13
erta
-0.13
akra
-0.12
POSITIVE LOGITS
reflection
0.80
-ref
0.79
reflect
0.78
reflect
0.75
reflected
0.72
refl
0.71
reflection
0.71
ref
0.71
ref
0.70
reflections
0.70
Activations Density 0.141%