INDEX
Explanations
phrases related to changes or trends over time
references to recent events or trends
New Auto-Interp
Negative Logits
0000000000000000
-0.70
çīĪ
-0.69
NPR
-0.66
trap
-0.66
£ı
-0.65
shalt
-0.65
death
-0.64
STAR
-0.62
claimer
-0.62
istani
-0.62
POSITIVE LOGITS
tandem
1.31
popularity
1.17
response
1.15
versely
1.13
unison
1.08
importance
1.07
leaps
1.04
favor
1.02
prominence
1.01
relation
1.01
Activations Density 0.125%