INDEX
Explanations
instances of action words
instances of the phrase "to do" indicating actions or tasks
New Auto-Interp
Negative Logits
lights
-0.80
mare
-0.71
ussen
-0.67
pa
-0.65
wagen
-0.64
Reviewer
-0.62
tight
-0.62
Handling
-0.62
ware
-0.61
bane
-0.59
POSITIVE LOGITS
omsday
0.99
pez
0.98
omething
0.91
ppel
0.85
lez
0.78
ggy
0.78
oms
0.77
ozy
0.77
lyak
0.76
something
0.75
Activations Density 0.102%