INDEX
Explanations
passages related to political events and news
New Auto-Interp
Negative Logits
ibrary
-0.33
DonaldTrump
-0.31
Bulgar
-0.28
uchin
-0.28
Aval
-0.27
xia
-0.27
ItemTracker
-0.27
TODAY
-0.26
ÃŁ
-0.26
Hung
-0.26
POSITIVE LOGITS
alike
0.71
thereof
0.67
accordingly
0.63
respectively
0.62
thereafter
0.59
thereto
0.53
therein
0.52
versa
0.48
attRot
0.41
afterward
0.40
Activations Density 46.970%