INDEX
Explanations
instances of the word "let" followed by a high activation value of 9 or 10
instances of the word "let."
New Auto-Interp
Negative Logits
Heroic
-0.64
results
-0.61
seats
-0.61
Dark
-0.61
proprietary
-0.58
dead
-0.58
students
-0.57
offices
-0.55
advisors
-0.55
Universe
-0.55
POSITIVE LOGITS
let
4.61
lets
3.75
LET
2.78
letes
1.95
lete
1.93
lette
1.84
lett
1.78
leted
1.49
lest
1.45
leton
1.34
Activations Density 0.020%