INDEX
Explanations
phrases related to cause and effect
instances of the word "effect" and its variations
New Auto-Interp
Negative Logits
pigeon
-0.75
dig
-0.68
Methodist
-0.62
croft
-0.60
apest
-0.59
zar
-0.59
Timber
-0.58
Dud
-0.58
Alto
-0.57
Compet
-0.56
POSITIVE LOGITS
iveness
1.43
ual
1.17
ively
1.15
uated
1.15
uating
1.07
uation
1.04
uates
0.95
ually
0.95
ivation
0.90
uate
0.89
Activations Density 0.048%