INDEX
Explanations
the word "got" at various activation levels
instances of the verb "got" and its variations
New Auto-Interp
Negative Logits
velt
-0.62
vein
-0.59
amd
-0.59
Protect
-0.57
Archdemon
-0.57
trust
-0.57
mare
-0.56
deen
-0.55
die
-0.54
orable
-0.54
POSITIVE LOGITS
rid
1.25
acquainted
1.01
permission
0.89
cloneembedreportprint
0.88
underway
0.88
lucky
0.87
cha
0.80
entangled
0.78
bog
0.77
caught
0.76
Activations Density 0.110%