INDEX
Explanations
phrases related to asserting or contesting claims
instances of the word "claim."
New Auto-Interp
Negative Logits
arrang
-0.87
srf
-0.86
newcom
-0.76
electing
-0.75
Watching
-0.73
simul
-0.73
actionGroup
-0.72
destro
-0.72
cffffcc
-0.67
odes
-0.66
POSITIVE LOGITS
claims
0.93
ylum
0.86
Claim
0.85
ifications
0.82
orial
0.79
claim
0.78
oux
0.77
ages
0.77
ulent
0.76
Cheong
0.75
Activations Density 0.024%