INDEX
Explanations
instances of the word "similar" in the text
references to similar events or situations
New Auto-Interp
Negative Logits
OST
-0.79
arden
-0.79
UME
-0.70
phasis
-0.69
ribution
-0.67
hest
-0.66
oway
-0.65
@@@@
-0.65
wood
-0.65
Beans
-0.63
POSITIVE LOGITS
vein
1.11
sized
0.96
worldly
0.95
fate
0.94
minded
0.93
minded
0.91
amounts
0.89
sentiments
0.88
twins
0.83
lihood
0.82
Activations Density 0.028%