INDEX
Explanations
product or brand names in capital letters
abbreviations and acronyms related to a particular domain or context
New Auto-Interp
Negative Logits
Paul
-0.64
Aman
-0.64
agra
-0.63
mary
-0.63
beg
-0.62
Abraham
-0.62
ammon
-0.61
Jama
-0.61
aisle
-0.61
elect
-0.61
POSITIVE LOGITS
TL
4.61
tl
2.34
TL
2.18
TN
1.67
TF
1.27
TP
1.25
TG
1.23
TD
1.19
TPS
1.18
KT
1.17
Activations Density 0.010%