INDEX
Explanations
the word "Free" followed by a variety of different terms
references to the term "Free."
New Auto-Interp
Negative Logits
acea
-0.68
IOR
-0.63
bore
-0.62
stroke
-0.62
essa
-0.61
aggrav
-0.60
isks
-0.60
TIT
-0.59
leth
-0.58
ihu
-0.57
POSITIVE LOGITS
Free
3.89
Free
2.84
free
2.29
free
2.25
FREE
2.15
FREE
1.91
Freedom
1.26
Freed
1.12
Freedom
1.11
Freeze
1.11
Activations Density 0.015%