INDEX
Explanations
mentions of the word "first" and its variations
New Auto-Interp
Negative Logits
méri
-0.94
Theſe
-0.89
Magdalene
-0.88
decorada
-0.88
ApJ
-0.87
Schemes
-0.85
Scrolls
-0.85
équilibr
-0.85
LEncoder
-0.85
Tales
-0.84
POSITIVE LOGITS
First
2.01
FIRST
1.92
first
1.90
First
1.89
FIRST
1.86
first
1.72
first
1.40
getFirst
1.26
ersten
1.25
rst
1.25
Activations Density 0.131%