INDEX
Explanations
proper nouns
references to individuals or titles associated with "Paolo."
New Auto-Interp
Negative Logits
eering
-0.83
eer
-0.72
scape
-0.70
bol
-0.67
enegger
-0.66
piece
-0.64
cape
-0.64
Lag
-0.63
bolt
-0.63
naire
-0.61
POSITIVE LOGITS
ignt
0.90
USE
0.90
eeper
0.86
olis
0.85
IRED
0.85
olo
0.85
ublic
0.81
arser
0.81
izo
0.80
UGE
0.79
Activations Density 0.052%