INDEX
Explanations
instances where someone is searching or seeking something
instances of the phrase "looking for."
New Auto-Interp
Negative Logits
Own
-0.63
MQ
-0.58
dt
-0.58
IL
-0.57
Lay
-0.56
WN
-0.56
paralle
-0.55
accompan
-0.55
©
-0.54
ACC
-0.54
POSITIVE LOGITS
forward
1.13
for
0.97
Forward
0.85
forwards
0.82
towards
0.79
forward
0.78
toward
0.77
oward
0.74
backward
0.73
desperately
0.72
Activations Density 0.050%