INDEX
Explanations
adjectives related to physical attributes or actions
positive adjectives and adverbs that convey a sense of safety or improvement
New Auto-Interp
Negative Logits
ITNESS
-0.74
vous
-0.72
Quantity
-0.68
iph
-0.67
âĢ¢âĢ¢âĢ¢âĢ¢
-0.65
Divinity
-0.64
akings
-0.59
ROR
-0.58
ãĥĥãĥĪ
-0.55
Difference
-0.55
POSITIVE LOGITS
aneously
1.29
heartedly
1.09
handedly
1.05
enough
1.03
ly
1.01
edly
0.94
istically
0.90
rily
0.90
lly
0.87
distances
0.84
Activations Density 0.250%