INDEX
Explanations
the word "same."
references to the concept of sameness
New Auto-Interp
Negative Logits
ases
-0.73
*=-
-0.68
rection
-0.68
orsi
-0.66
rosso
-0.65
åĪ
-0.65
bane
-0.65
gets
-0.63
omics
-0.63
icism
-0.62
POSITIVE LOGITS
thing
0.93
exact
0.88
amount
0.72
ol
0.71
kind
0.70
principle
0.70
sized
0.68
yll
0.68
old
0.67
vein
0.66
Activations Density 0.033%