INDEX
Explanations
mentions of things that are alike or resemble one another
instances of the word "similar" and its variations
New Auto-Interp
Negative Logits
arden
-0.77
arest
-0.75
hest
-0.72
hend
-0.71
OST
-0.69
Diamond
-0.68
ensibly
-0.66
Benz
-0.66
gemony
-0.65
ellen
-0.65
POSITIVE LOGITS
sized
1.17
sentiments
1.13
minded
1.06
fate
1.05
lihood
1.04
vein
1.03
phenomena
1.01
ities
0.99
worldly
0.97
phenomenon
0.95
Activations Density 0.035%