INDEX
Explanations
instances of the word 'first' followed by a number
occurrences of the word "first."
New Auto-Interp
Negative Logits
orate
-0.80
eez
-0.63
hin
-0.61
endif
-0.59
soType
-0.58
illes
-0.58
impunity
-0.57
SPONSORED
-0.56
å§«
-0.55
rah
-0.55
POSITIVE LOGITS
first
3.13
first
2.51
FIRST
2.13
First
2.01
First
1.82
second
1.68
earliest
1.66
initial
1.55
fourth
1.39
third
1.36
Activations Density 0.095%