INDEX
Explanations
references to "ubs" or "UB" within the text
references to various types of "subs" or subcategories in a specific context
New Auto-Interp
Negative Logits
Evening
-0.63
Pent
-0.63
fortune
-0.61
Directions
-0.60
successive
-0.59
ling
-0.59
Paraly
-0.59
Murd
-0.58
Forensic
-0.58
AFP
-0.58
POSITIVE LOGITS
ubs
4.70
ub
2.29
UB
1.93
ubes
1.75
ubby
1.54
ube
1.42
ubi
1.36
ubb
1.33
uber
1.13
unts
1.10
Activations Density 0.013%