INDEX
Explanations
phrases related to significant events or changes in life
New Auto-Interp
Negative Logits
ãĥ§
-0.15
ulta
-0.15
нÑĮ
-0.15
TRACE
-0.14
.CodeAnalysis
-0.14
ãĥ¼ãĥIJ
-0.13
Tub
-0.13
inski
-0.13
çķª
-0.13
ivol
-0.13
POSITIVE LOGITS
Blue
1.62
blue
1.51
Blue
1.50
BLUE
1.40
blue
1.34
-blue
1.29
BLUE
1.21
/blue
1.13
_blue
1.10
èĵĿ
1.10
Activations Density 0.233%