INDEX
Explanations
pairs of opposites or contrasts in text
instances of the word "differently" and its related contexts
New Auto-Interp
Negative Logits
urance
-0.77
publication
-0.75
stay
-0.69
subsequ
-0.69
raltar
-0.66
passage
-0.65
comprehension
-0.63
supremacy
-0.62
Mouth
-0.62
————————
-0.61
POSITIVE LOGITS
situated
1.04
disposed
1.02
inclined
1.00
tuned
0.92
configured
0.90
selected
0.89
suited
0.87
impacted
0.87
graded
0.86
affected
0.84
Activations Density 0.044%