INDEX
Explanations
rules and regulations or restrictions
rules and regulations
New Auto-Interp
Negative Logits
figure
-0.71
mirac
-0.69
miah
-0.67
CrossRef
-0.66
acerb
-0.65
framework
-0.63
Lyndon
-0.63
prototype
-0.62
effic
-0.62
ortun
-0.61
POSITIVE LOGITS
prohibited
1.07
nudity
1.02
exceptions
0.97
allowable
0.97
permitted
0.96
prohibits
0.93
forbidden
0.92
bidden
0.92
forbids
0.91
iquette
0.91
Activations Density 0.905%