INDEX
Explanations
phrases related to expertise and experience
New Auto-Interp
Negative Logits
RectangleBorder
-0.70
-0.64
Worse
-0.55
alibi
-0.55
ⓘ
-0.55
ftagPool
-0.53
bienven
-0.53
supposed
-0.53
دیگر
-0.53
noce
-0.51
POSITIVE LOGITS
offices
0.67
extensive
0.65
offices
0.57
numerous
0.57
širo
0.54
ReusableCell
0.53
Билгалдахарш
0.53
many
0.52
vast
0.52
extensive
0.52
Activations Density 0.142%