INDEX
Explanations
the term "fan" in different contexts
New Auto-Interp
Negative Logits
===============
-0.73
gaun
-0.65
Nex
-0.63
kj
-0.62
Whittaker
-0.61
///////////////
-0.60
שה
-0.59
fris
-0.59
bé
-0.58
herin
-0.57
POSITIVE LOGITS
fan
1.64
Fan
1.64
fans
1.59
FAN
1.59
Fan
1.55
Fans
1.55
fan
1.53
FAN
1.48
fans
1.46
Fans
1.45
Activations Density 0.013%