INDEX
Explanations
phrases or sentences that are used to verify that a human is interacting with a system and is not a robot
references to robots or robotic concepts
New Auto-Interp
Negative Logits
ources
-0.67
Oaks
-0.66
ciples
-0.64
Tong
-0.64
Emin
-0.63
drops
-0.63
avenues
-0.62
ciation
-0.62
monds
-0.61
Hudson
-0.61
POSITIVE LOGITS
ichick
0.76
anymore
0.75
obic
0.73
reliant
0.71
hered
0.69
ecast
0.68
proof
0.68
dayName
0.68
iframe
0.68
escent
0.66
Activations Density 0.020%