INDEX
Explanations
terms related to interception, such as intercept, interceptor, and interceptions
mentions of the term "Intercept" related to surveillance or military contexts
New Auto-Interp
Negative Logits
deaf
-0.80
WAYS
-0.65
UGE
-0.65
oke
-0.64
Downloadha
-0.63
creen
-0.63
HAHA
-0.61
\\\\
-0.61
cker
-0.61
HAHAHAHA
-0.59
POSITIVE LOGITS
Intercept
1.08
ors
1.06
abad
0.94
rador
0.93
rained
0.90
rons
0.90
aired
0.86
oway
0.80
ions
0.79
illian
0.76
Activations Density 0.012%