INDEX
Explanations
instances of the word "have" and its variations
New Auto-Interp
Negative Logits
pmatrix
-0.64
Coats
-0.61
Cuen
-0.61
tuyến
-0.60
addComponent
-0.60
nitus
-0.59
path
-0.58
path
-0.58
Credito
-0.57
strich
-0.57
POSITIVE LOGITS
having
0.85
having
0.82
Having
0.82
HAVE
0.79
Having
0.77
Had
0.74
HAVING
0.72
Have
0.70
bianche
0.69
Had
0.69
Activations Density 0.232%