INDEX
Explanations
specific forms of adjectives and verbs related to qualities and characteristics
New Auto-Interp
Negative Logits
+#+#
-0.65
חיצוניים
-0.61
vérit
-0.60
GENERATED
-0.59
geda
-0.59
PerformLayout
-0.59
rungsseite
-0.57
:✨
-0.57
godz
-0.56
terecht
-0.56
POSITIVE LOGITS
<bos>
0.77
AsUp
0.60
+:+
0.55
ImGui
0.54
UnusedPrivate
0.54
xious
0.54
Rhestr
0.52
TextAppearance
0.52
Mechan
0.51
Trimethyl
0.50
Activations Density 0.598%