INDEX
Explanations
phrases that emphasize the significance of specific nouns or concepts
"[the] [noun]" pattern
specific concepts or items
New Auto-Interp
Negative Logits
راد
-0.51
ComVisible
-0.47
Vinc
-0.45
Chwiliwch
-0.44
AlterField
-0.43
hesitating
-0.43
lani
-0.43
IBAction
-0.41
autorytatywna
-0.41
copo
-0.41
POSITIVE LOGITS
phenomenon
0.93
genre
0.92
fenomeno
0.80
phénomène
0.78
technique
0.71
modality
0.70
phenomena
0.70
malady
0.67
fenómeno
0.66
artige
0.65
Activations Density 0.412%