INDEX
Explanations
references to electrical plugs or plug-related actions
references to "plug-in" technology or devices
New Auto-Interp
Negative Logits
quez
-0.72
birth
-0.71
perse
-0.68
IAS
-0.66
Butterfly
-0.65
surv
-0.63
vict
-0.62
swer
-0.62
Lann
-0.61
ACTIONS
-0.61
POSITIVE LOGITS
ging
1.42
ged
1.33
glers
1.22
plug
0.98
plug
0.95
atis
0.88
plugs
0.87
lessly
0.85
Plug
0.83
gers
0.81
Activations Density 0.013%