INDEX
Explanations
words related to alternatives or options
references to alternative concepts or viewpoints
New Auto-Interp
Negative Logits
hips
-0.89
older
-0.86
haw
-0.85
Import
-0.73
artney
-0.71
girls
-0.71
awar
-0.69
encers
-0.68
Chicken
-0.68
ahon
-0.68
POSITIVE LOGITS
alternative
1.08
Altern
1.05
alternatives
1.05
atives
0.95
solutions
0.91
options
0.87
altern
0.83
Alternative
0.82
viewpoints
0.80
explanations
0.79
Activations Density 0.013%