INDEX
Explanations
repeated occurrences of the word "same"
expressions of similarity or sameness
New Auto-Interp
Negative Logits
*=-
-0.72
omics
-0.69
Provided
-0.64
orsi
-0.64
akings
-0.61
åĪ
-0.60
ommod
-0.60
arest
-0.60
OST
-0.59
erest
-0.59
POSITIVE LOGITS
thing
0.96
exact
0.91
vein
0.88
amount
0.81
rity
0.74
sorts
0.71
kind
0.70
day
0.70
kinds
0.67
same
0.67
Activations Density 0.045%