INDEX
Explanations
phrases related to guidance, persuasion, or decision-making
specific emotional indicators or expressions of urgency
New Auto-Interp
Negative Logits
mathemat
-0.74
Jericho
-0.73
Sag
-0.69
Naked
-0.69
fortun
-0.69
Shap
-0.66
Tek
-0.63
Voyager
-0.62
Forth
-0.61
chart
-0.61
POSITIVE LOGITS
Ļ
1.35
¬
1.21
ľ
1.14
¢
1.12
ı
1.11
>>
1.09
ĸ
1.08
Ħ¢
1.07
--+
1.05
âĪĴ
1.04
Activations Density 0.408%