INDEX
Explanations
phrases related to events or places with the term "Faire" in them
references to the concept of 'fair'
New Auto-Interp
Negative Logits
odd
-0.67
ologically
-0.67
insula
-0.66
arresting
-0.65
igating
-0.65
umm
-0.63
uzzle
-0.63
ulates
-0.62
igsaw
-0.61
ushed
-0.60
POSITIVE LOGITS
aire
1.27
Lys
0.80
Francois
0.79
neau
0.79
nat
0.79
eers
0.75
aires
0.74
François
0.73
Leban
0.73
ppe
0.73
Activations Density 0.007%