INDEX
Explanations
phrases with the word "those" followed by a number
the phrase "for those," indicating content that addresses a specific audience or group
New Auto-Interp
Negative Logits
ob
-0.73
maker
-0.69
forth
-0.68
ILY
-0.68
enegger
-0.67
atform
-0.65
onis
-0.65
kamp
-0.64
Resolution
-0.64
Cheong
-0.64
POSITIVE LOGITS
wishing
1.03
interested
0.97
pesky
0.92
redes
0.90
unfamiliar
0.88
wanting
0.88
curious
0.85
unlucky
0.81
purposes
0.79
kinds
0.79
Activations Density 0.047%