INDEX
Explanations
adverbs indicating similarity or comparison
phrases that indicate comparisons or similarities
New Auto-Interp
Negative Logits
Score
-0.62
"},"
-0.62
ocene
-0.60
————————
-0.59
aughs
-0.56
\/\/
-0.55
/"
-0.55
@@@@
-0.55
stay
-0.54
http
-0.54
POSITIVE LOGITS
situated
0.89
minded
0.79
minded
0.73
,
0.67
sized
0.66
apy
0.66
inclined
0.65
quartered
0.65
leep
0.65
importantly
0.64
Activations Density 0.033%