INDEX
Explanations
instances of permissive or relinquishing language
New Auto-Interp
Negative Logits
awtextra
-0.49
HasForeignKey
-0.43
<<<<<<<<<<<<<<
-0.43
endsection
-0.41
onAttach
-0.40
wur
-0.40
scattata
-0.37
исленность
-0.37
gyhoeddwyd
-0.35
dymyr
-0.35
POSITIVE LOGITS
letting
1.09
Letting
1.05
Letting
1.05
let
0.97
letting
0.95
Allow
0.87
Allowing
0.85
allowing
0.82
allow
0.82
lets
0.77
Activations Density 0.355%