INDEX
Explanations
comparative words and phrases
phrases indicating comparisons or negations of quantity and condition
New Auto-Interp
Negative Logits
?????
-0.71
EVA
-0.69
vows
-0.64
alion
-0.64
virginity
-0.62
Chemistry
-0.61
Advertisement
-0.60
Hom
-0.59
Vag
-0.56
anny
-0.56
POSITIVE LOGITS
resembling
0.81
rir
0.74
»Ĵ
0.73
describ
0.71
icter
0.71
seen
0.71
lio
0.69
aring
0.69
Called
0.68
yond
0.68
Activations Density 0.444%