INDEX
Explanations
phrases describing contrasts or differences between entities or conditions
instances of the word "and" indicating conjunctions or connections between different ideas or concepts
New Auto-Interp
Negative Logits
Citation
-0.78
Voice
-0.71
@#&
-0.71
elle
-0.70
hei
-0.70
Dri
-0.69
iard
-0.69
ãĤ¨
-0.68
yn
-0.68
)</
-0.68
POSITIVE LOGITS
ours
0.96
those
0.80
actual
0.79
theirs
0.76
hence
0.72
yours
0.72
downright
0.70
ones
0.67
customary
0.67
reality
0.66
Activations Density 0.131%