INDEX
Explanations
Italian names, specifically "Giuseppe" and "Carlo"
specific names and terms related to Indoctrination and individuals involved in the field
New Auto-Interp
Negative Logits
hips
-1.00
Observer
-0.82
glim
-0.79
irlf
-0.76
neys
-0.74
alid
-0.70
umin
-0.70
Dash
-0.69
rity
-0.69
ging
-0.68
POSITIVE LOGITS
forth
1.02
xual
0.93
Marino
0.87
Äį
0.87
ppe
0.86
zzo
0.85
oples
0.82
ctr
0.81
zzi
0.80
pper
0.78
Activations Density 0.026%