INDEX
Explanations
terms related to physical fitness or health
repetitive phrases or patterns in text
New Auto-Interp
Negative Logits
obook
-0.69
Introduced
-0.67
Advertisement
-0.63
abo
-0.63
,[
-0.62
ajor
-0.62
bryce
-0.62
agram
-0.60
âĢķ
-0.60
Adult
-0.59
POSITIVE LOGITS
huh
1.42
eh
1.21
please
0.98
Pt
0.86
sir
0.76
yes
0.76
yeah
0.74
etc
0.71
beware
0.68
meanwhile
0.68
Activations Density 0.341%