INDEX
Explanations
the verb "be" in various forms and contexts
New Auto-Interp
Negative Logits
ongyang
-0.72
rigs
-0.69
plings
-0.65
ortmund
-0.64
strain
-0.64
aceutical
-0.62
rones
-0.62
strous
-0.62
dispute
-0.62
Bever
-0.61
POSITIVE LOGITS
able
1.22
reunited
0.91
reminded
0.86
reborn
0.83
surrounded
0.83
challenged
0.83
treated
0.82
bitten
0.82
seated
0.82
aware
0.81
Activations Density 0.040%