INDEX
Explanations
references to the word "Urbana"
repeated mentions of the term "Ur."
New Auto-Interp
Negative Logits
女
-0.80
Scarlet
-0.79
washer
-0.73
Sins
-0.72
atform
-0.71
Dickinson
-0.67
Clarkson
-0.64
eting
-0.64
brackets
-0.62
NetMessage
-0.62
POSITIVE LOGITS
gent
1.22
seless
1.05
gery
1.05
gencies
1.03
gency
1.01
du
0.99
gently
0.98
pee
0.95
inary
0.93
bers
0.92
Activations Density 0.025%