INDEX
Explanations
instances of the word "right" in various contexts
New Auto-Interp
Negative Logits
indsight
-0.17
fairness
-0.14
YRO
-0.14
thorough
-0.14
asier
-0.14
alt
-0.14
treff
-0.14
elize
-0.13
convenience
-0.13
good
-0.13
POSITIVE LOGITS
amount
0.29
kind
0.24
kinds
0.22
amount
0.21
iele
0.21
combination
0.20
-sized
0.20
KIND
0.20
ilk
0.19
est
0.18
Activations Density 0.038%