INDEX
Explanations
instances of the word 'fair' used in a context related to justice, equality, or fairness
references to fairness and equitable treatment
New Auto-Interp
Negative Logits
apse
-0.82
CHAT
-0.79
uality
-0.75
Ultra
-0.66
clips
-0.65
Reincarnated
-0.63
hent
-0.63
cano
-0.63
OUS
-0.61
Extra
-0.60
POSITIVE LOGITS
grounds
1.26
yt
1.16
ground
0.91
fair
0.87
iciary
0.85
fair
0.84
child
0.79
abouts
0.78
fare
0.75
ies
0.74
Activations Density 0.017%