INDEX
Explanations
instances in text that involve performing an action or deed
New Auto-Interp
Negative Logits
case
-0.64
hog
-0.63
)=(
-0.62
Printed
-0.59
fields
-0.59
Entered
-0.58
Hok
-0.57
Madagascar
-0.57
lights
-0.57
Methods
-0.57
POSITIVE LOGITS
omsday
1.11
pez
1.08
ppel
1.06
oms
0.95
herty
0.94
lez
0.92
vet
0.91
ggy
0.90
gging
0.88
xx
0.86
Activations Density 2.397%