INDEX
Explanations
instances of the word 'first'
instances of the word "first" related to achievements or milestones
New Auto-Interp
Negative Logits
bara
-0.87
skirts
-0.75
etics
-0.73
Vish
-0.69
mund
-0.65
plex
-0.65
urat
-0.65
Mub
-0.65
park
-0.62
witz
-0.62
POSITIVE LOGITS
foray
1.26
outing
1.09
appearance
0.92
playthrough
0.88
cousin
0.86
glimpse
0.86
birthday
0.84
attempt
0.83
sip
0.80
incarnation
0.80
Activations Density 0.095%