INDEX
Explanations
questions or sentences that are inquiring about specific topics or subjects
questions that begin with "does."
New Auto-Interp
Negative Logits
fights
-0.81
isers
-0.80
agonists
-0.78
isphere
-0.76
runners
-0.76
boats
-0.75
eers
-0.74
offs
-0.73
bags
-0.71
asers
-0.71
POSITIVE LOGITS
olation
0.97
anyone
0.84
anybody
0.82
berra
0.81
olated
0.76
omething
0.74
olate
0.69
nt
0.67
omorphic
0.66
terness
0.65
Activations Density 0.051%