INDEX
Explanations
indications of time and frequency related to actions or events
New Auto-Interp
Negative Logits
unsch
-0.17
lug
-0.14
icut
-0.13
723
-0.13
à¸łà¸²à¸©
-0.12
Bearer
-0.12
LIABLE
-0.12
APPER
-0.12
аÑĢÑħ
-0.12
abei
-0.12
POSITIVE LOGITS
allow
1.06
Allow
0.98
allow
0.98
Allow
0.94
allowing
0.93
allows
0.93
åħģ
0.87
ALLOW
0.85
allowed
0.82
permit
0.82
Activations Density 0.627%