INDEX
Explanations
words related to situations or events that emphasize a sense of continuation or persistence
references to evaluations or situations
New Auto-Interp
Negative Logits
nob
-0.64
Wise
-0.62
expert
-0.61
Cary
-0.61
kn
-0.58
Podesta
-0.58
magn
-0.57
counsel
-0.56
Chain
-0.56
Hub
-0.56
POSITIVE LOGITS
uation
4.44
uations
3.23
uated
3.10
uating
3.04
uates
2.83
uate
2.68
uity
1.61
uing
1.60
uality
1.47
uum
1.31
Activations Density 0.012%