INDEX
Explanations
verbs related to action or change
New Auto-Interp
Negative Logits
Archdemon
-0.62
velt
-0.61
annex
-0.58
experiment
-0.57
Britann
-0.57
endeavour
-0.56
iege
-0.56
cardinal
-0.54
endeavor
-0.54
Principle
-0.54
POSITIVE LOGITS
rid
1.46
tin
1.02
cloneembedreportprint
0.98
acquainted
0.97
distracted
0.91
TING
0.89
sucked
0.88
bored
0.87
bent
0.84
underway
0.83
Activations Density 0.462%