INDEX
Explanations
references to invisibility, both literal and metaphorical
references to the concept of invisibility
New Auto-Interp
Negative Logits
andals
-0.89
ortment
-0.82
ership
-0.81
eda
-0.77
olitan
-0.76
ulet
-0.76
âķIJâķIJ
-0.76
aeper
-0.75
YC
-0.74
ocations
-0.74
POSITIVE LOGITS
invisible
0.88
invis
0.85
worm
0.83
immune
0.78
phantom
0.77
Invisible
0.77
glove
0.71
worms
0.70
Coffin
0.69
Lap
0.69
Activations Density 0.037%