INDEX
Explanations
adjectives and verbs related to physical appearance and actions
phrases indicating conditions, assessments, or choices
New Auto-Interp
Negative Logits
______
-0.62
.''.
-0.57
âĢİ
-0.54
¶
-0.53
antioxid
-0.53
gmaxwell
-0.53
thence
-0.53
↵
-0.52
pursuant
-0.52
Jihad
-0.52
POSITIVE LOGITS
Đ
1.19
ù
1.19
RandomRedditor
1.19
Ă
1.19
ă
1.19
ø
1.19
đ
1.19
ė
1.19
Ě
1.19
û
1.19
Activations Density 1.515%