INDEX
Explanations
mentions of wire or wire-related terms
references to wire fraud and related terms
New Auto-Interp
Negative Logits
ivia
-0.84
contestant
-0.67
eston
-0.66
contestants
-0.65
BIP
-0.65
foss
-0.64
olute
-0.62
uum
-0.62
zzy
-0.61
ļé
-0.61
POSITIVE LOGITS
lessly
1.21
tap
1.15
frame
1.07
fram
1.00
frames
0.97
mesh
0.94
wire
0.90
haired
0.87
fences
0.84
lessness
0.81
Activations Density 0.020%