INDEX
Explanations
references to fictional narratives or literary themes
New Auto-Interp
Negative Logits
ftagPool
-0.51
للاسماء
-0.51
//
-0.50
Roskov
-0.45
drawal
-0.45
kaarangay
-0.45
EndInit
-0.44
GenerationType
-0.44
bedienen
-0.44
onCreateView
-0.42
POSITIVE LOGITS
mention
1.54
comment
1.53
comments
1.49
mentioning
1.47
explanation
1.47
mentions
1.42
explain
1.38
explaining
1.38
saying
1.38
description
1.36
Activations Density 1.649%