INDEX
Explanations
specific nouns and actions related to characters and their interactions in a narrative context
New Auto-Interp
Negative Logits
raisonnable
-0.68
pouvoit
-0.66
comprends
-0.60
ktop
-0.60
lanze
-0.60
préfé
-0.60
adeloupe
-0.59
GOTREF
-0.59
auroit
-0.58
delige
-0.58
POSITIVE LOGITS
query
0.52
whose
0.49
another
0.48
recently
0.46
RSSSF
0.45
Lordships
0.45
RenderAtEndOf
0.45
tito
0.44
recientemente
0.44
claimed
0.43
Activations Density 0.552%