INDEX
Explanations
high-frequency, single tokens that contribute to the overall sentiment or tone of the document
New Auto-Interp
Negative Logits
للمعارف
-0.67
فريبيس
-0.60
ItemBackground
-0.56
ecture
-0.54
IRQn
-0.53
xase
-0.51
EndInit
-0.51
ژاد
-0.50
rhosis
-0.50
Fazit
-0.50
POSITIVE LOGITS
seven
0.82
eight
0.80
nine
0.78
six
0.77
thousands
0.75
thirteen
0.74
five
0.74
sixteen
0.73
least
0.72
thirty
0.72
Activations Density 0.205%