INDEX
Explanations
words related to rules and regulations
references to specific regulations or rules, particularly "Rule 9" and "Rule 8."
New Auto-Interp
Negative Logits
itate
-0.86
acters
-0.82
ité
-0.82
Pradesh
-0.75
Hots
-0.73
itant
-0.72
apest
-0.69
izoph
-0.68
imar
-0.68
velength
-0.68
POSITIVE LOGITS
book
1.28
books
1.14
making
1.00
breakers
0.95
breaker
0.94
maker
0.93
makers
0.90
witz
0.85
breaker
0.83
Rule
0.80
Activations Density 0.025%