INDEX
Explanations
additional information or conditions
New Auto-Interp
Negative Logits
evokes
1.09
paves
1.09
heartwarming
1.06
exudes
1.06
deliciously
1.05
endearing
1.04
fosters
1.02
vibes
0.99
boasts
0.98
rhetoric
0.96
POSITIVE LOGITS
Furthermore
1.24
Additionally
1.22
Also
1.19
Moreover
1.17
Note
1.12
If
1.09
Specifically
1.08
如果
1.06
Furthermore
1.06
Additionally
1.05
Activations Density 0.205%