INDEX
Explanations
phrases related to specific names or entities
occurrences of a specific character or symbol in the text
New Auto-Interp
Negative Logits
flowering
-0.79
disadvant
-0.78
fading
-0.64
complement
-0.64
adolesc
-0.63
disabling
-0.63
Anglo
-0.62
fodder
-0.62
CLR
-0.62
democracy
-0.62
POSITIVE LOGITS
ï¸ı
1.10
s
1.06
said
0.94
saw
0.94
has
0.93
sure
0.92
ccording
0.92
had
0.91
mad
0.89
mental
0.88
Activations Density 0.271%