INDEX
Explanations
similes using the word "like"
New Auto-Interp
Negative Logits
Published
-0.89
inion
-0.77
hiba
-0.74
idates
-0.72
inas
-0.71
ittal
-0.71
endiary
-0.69
iets
-0.68
ione
-0.67
overy
-0.67
POSITIVE LOGITS
lihood
1.87
lier
1.34
liest
1.33
liness
0.98
minded
0.97
minded
0.94
ours
0.93
wildfire
0.85
clock
0.79
yours
0.79
Activations Density 2.327%