INDEX
Explanations
phrases related to negative or derogatory descriptions
references to holes, particularly in a metaphorical or humorous context
New Auto-Interp
Negative Logits
ency
-0.83
ERG
-0.78
ables
-0.70
ione
-0.69
lux
-0.69
Joy
-0.68
ICT
-0.67
REE
-0.66
Carbuncle
-0.66
RON
-0.65
POSITIVE LOGITS
hole
1.56
holes
1.29
Hole
0.94
hole
0.91
ocene
0.88
shit
0.84
holes
0.81
hog
0.79
izons
0.75
Cerberus
0.73
Activations Density 0.014%