INDEX
Explanations
phrases introducing a quote
instances of quoted speech or statements made by individuals
New Auto-Interp
Negative Logits
hawks
-0.66
boosters
-0.63
delinquent
-0.61
bilt
-0.60
stabilization
-0.56
happ
-0.55
recomm
-0.54
ebin
-0.54
halla
-0.53
aples
-0.53
POSITIVE LOGITS
'[
1.19
"[
1.17
"'
1.16
""
1.10
"(
1.10
"
1.08
''
1.08
'
1.04
"...
1.02
"@
0.99
Activations Density 0.059%