INDEX
Explanations
verbs in the infinitive form
New Auto-Interp
Negative Logits
lights
-0.73
sg
-0.62
DOC
-0.61
pac
-0.60
mares
-0.59
ware
-0.58
opol
-0.58
dayName
-0.56
uses
-0.56
Prot
-0.54
POSITIVE LOGITS
omsday
1.00
pez
0.95
ppel
0.93
omething
0.92
lez
0.88
xx
0.85
ggy
0.84
ozy
0.82
laundry
0.81
oms
0.81
Activations Density 0.362%