INDEX
Explanations
demonstrative terms followed by information or facts
the phrase "Here" followed by additional information or context
New Auto-Interp
Negative Logits
omedical
-0.65
omore
-0.63
Zen
-0.62
grace
-0.60
Shape
-0.58
hygiene
-0.57
offensive
-0.55
LSD
-0.55
ped
-0.55
ces
-0.54
POSITIVE LOGITS
tics
1.07
here
1.03
abouts
0.97
tical
0.94
tic
0.94
Transcript
0.89
newsp
0.89
ford
0.78
Comes
0.78
Here
0.73
Activations Density 0.014%