INDEX
Explanations
the word "demonstrate" and its variations
demonstrate that or the
New Auto-Interp
Negative Logits
wiſe
-0.65
Houſe
-0.65
leſs
-0.61
Portail
-0.60
unzel
-0.59
septic
-0.58
wrist
-0.57
ticides
-0.56
ingual
-0.55
Neck
-0.55
POSITIVE LOGITS
demonstration
1.60
demonstrate
1.52
demonstrated
1.48
demonstrating
1.44
Demonstration
1.39
Demonstr
1.38
demon
1.36
Demonstrate
1.34
demonstrates
1.34
demonstrations
1.33
Activations Density 0.016%