INDEX
Explanations
mentions of the name "Nicola."
New Auto-Interp
Negative Logits
ifiers
-0.86
eous
-0.76
bour
-0.73
jug
-0.68
ifier
-0.64
seams
-0.62
ors
-0.62
mund
-0.59
rupulous
-0.59
urable
-0.59
POSITIVE LOGITS
llan
0.77
actly
0.76
oche
0.74
ensen
0.72
kell
0.72
uador
0.70
keye
0.69
chip
0.68
TIME
0.66
pez
0.64
Activations Density 0.040%