INDEX
Explanations
phrases that emphasize responsible behavior and actions
responsible and prudent action
New Auto-Interp
Negative Logits
coû
-0.34
terminée
-0.32
gratuitement
-0.31
Fleisch
-0.31
differ
-0.31
réun
-0.31
Hofmann
-0.31
réjou
-0.29
difer
-0.29
entièrement
-0.29
POSITIVE LOGITS
autorytatywna
1.00
ſelves
0.84
prudent
0.79
Perſ
0.74
Autoritní
0.74
transférez
0.73
prudence
0.71
OMITBAD
0.71
Conſ
0.71
للاسماء
0.69
Activations Density 0.011%