INDEX
Explanations
occurrences of the word "had"
New Auto-Interp
Negative Logits
woordig
-0.75
pist
-0.70
Sammy
-0.69
orion
-0.67
SLS
-0.66
Olsson
-0.65
olor
-0.64
Torrey
-0.64
Reuter
-0.64
PreferredItem
-0.64
POSITIVE LOGITS
hoped
1.16
earlier
1.05
HAD
1.02
Earlier
0.99
had
0.97
Had
0.97
Earlier
0.92
hadn
0.90
Had
0.90
originalmente
0.87
Activations Density 0.153%