INDEX
Explanations
references to significant events or conditions that impact various practices, beliefs, or situations
New Auto-Interp
Negative Logits
ſelf
-0.88
ſelves
-0.85
houſe
-0.84
ſche
-0.79
Monfieur
-0.78
Majefty
-0.77
uſe
-0.77
domani
-0.76
RegressionTest
-0.76
:✨
-0.75
POSITIVE LOGITS
since
1.23
lately
0.96
since
0.96
recent
0.94
been
0.94
recently
0.89
SINCE
0.88
depuis
0.85
Since
0.83
sejak
0.83
Activations Density 0.789%