INDEX
Explanations
statements starting with "If that", implying a condition or scenario
phrases and clauses that suggest conditional statements or hypotheticals
New Auto-Interp
Negative Logits
ILCS
-0.67
indal
-0.61
ngth
-0.60
iband
-0.60
IOR
-0.59
Effective
-0.58
interstitial
-0.58
igmatic
-0.57
©¶æ¥µ
-0.57
HUD
-0.56
POSITIVE LOGITS
weren
1.54
were
1.16
were
1.10
hadn
1.09
happens
1.02
bothers
0.99
pans
0.98
fails
0.98
succeeds
0.98
ain
0.94
Activations Density 0.173%