INDEX
Explanations
the word "had" in various contexts, indicating a focus on past experiences or completed actions
New Auto-Interp
Negative Logits
woordig
-0.79
hlon
-0.65
knex
-0.61
ಿದೆ
-0.61
ocate
-0.60
perſon
-0.60
Olsson
-0.60
blume
-0.60
いません
-0.59
blumen
-0.58
POSITIVE LOGITS
had
3.57
Had
3.03
had
2.93
Had
2.91
HAD
2.66
HAD
2.11
hadden
2.06
hadde
1.94
hatte
1.93
hatten
1.93
Activations Density 0.091%