INDEX
Explanations
instances where the phrase "the first time" is used
New Auto-Interp
Negative Logits
ouk
-0.74
ement
-0.73
rake
-0.72
haar
-0.69
inders
-0.68
usters
-0.68
matter
-0.68
ead
-0.67
inki
-0.67
ona
-0.65
POSITIVE LOGITS
someone
0.80
ndra
0.73
somebody
0.73
anyone
0.72
offenders
0.71
they
0.70
Canadians
0.68
ever
0.67
encountering
0.66
eve
0.66
Activations Density 0.082%