INDEX
Explanations
French names or words
specific character sequences or key strings in the text
New Auto-Interp
Negative Logits
yip
-0.68
jriwal
-0.63
merit
-0.57
boo
-0.56
Wanted
-0.56
mates
-0.55
oteric
-0.55
mood
-0.54
minster
-0.54
mate
-0.53
POSITIVE LOGITS
î
0.91
issance
0.82
quin
0.79
uner
0.79
ère
0.77
eger
0.77
ean
0.74
ilton
0.74
ires
0.73
irs
0.73
Activations Density 0.148%