INDEX
Explanations
quotes and direct speech
words or phrases that indicate reported speech or quotations
New Auto-Interp
Negative Logits
¥µ
-0.76
illance
-0.75
cci
-0.73
iamond
-0.72
BAT
-0.72
pouch
-0.70
beads
-0.69
ara
-0.69
£ı
-0.69
ecast
-0.69
POSITIVE LOGITS
Fre
2.48
Fre
2.46
Freeman
2.02
fre
1.84
fre
1.59
FRE
1.52
Freak
1.50
Gre
1.40
Frey
1.22
Gre
1.14
Activations Density 0.231%