INDEX
Explanations
verbs indicating actions being done
repetitive usages of the verb forms "do" and "did" in relation to actions performed
New Auto-Interp
Negative Logits
furt
-0.68
Board
-0.64
CLUD
-0.61
Entered
-0.59
inently
-0.59
ricted
-0.58
ussen
-0.57
Det
-0.56
hner
-0.56
RAFT
-0.55
POSITIVE LOGITS
pez
1.15
differently
1.02
wrong
0.94
administr
0.91
wrong
0.81
actic
0.76
succeed
0.73
Äĩ
0.73
hing
0.72
onga
0.72
Activations Density 0.061%