INDEX
Explanations
factual statements and key issues related to decision-making and clarification
New Auto-Interp
Negative Logits
totalité
-0.61
stør
-0.57
exact
-0.56
Климат
-0.55
entire
-0.54
EXACT
-0.54
segala
-0.53
annica
-0.53
Entire
-0.51
piedi
-0.50
POSITIVE LOGITS
SOME
0.85
some
0.84
some
0.81
PerformLayout
0.75
Some
0.74
Some
0.73
nakalista
0.72
المعيارى
0.72
SOME
0.71
فريبيس
0.71
Activations Density 0.264%