INDEX
Explanations
verbs and verbal phrases indicating actions or events
Japanese and German verbs
verbs followed by auxiliaries
New Auto-Interp
Negative Logits
ProtoMessage
-1.02
دانشنامهٔ
-0.98
SharedCtor
-0.93
Jefus
-0.92
Houſe
-0.89
Monfieur
-0.89
houſe
-0.88
raiſ
-0.87
Majefty
-0.86
InjectAttribute
-0.85
POSITIVE LOGITS
worth
0.44
rius
0.42
urllib
0.40
allowed
0.40
<!
0.38
or
0.38
sure
0.37
0.37
.”
0.37
'
0.36
Activations Density 0.038%