INDEX
Explanations
names of specific individuals and locations
themes related to tunneling and surveillance
New Auto-Interp
Negative Logits
Temp
-0.79
IME
-0.75
tem
-0.74
ICLE
-0.72
temp
-0.70
GOODMAN
-0.67
Tem
-0.65
trump
-0.65
Tanz
-0.63
Gim
-0.62
POSITIVE LOGITS
heed
0.90
bed
0.88
pes
0.87
hire
0.85
roxy
0.85
quartered
0.85
ulously
0.85
nel
0.85
geon
0.84
everal
0.84
Activations Density 0.025%