Latex divides URL in bibliography and web page cannot be found - url

I get a wrong URL when I click the link in the bibliography of a PDF generated with Latex. I tried using both url and xurl packages but the error persists.
Here is the code:
#misc{understanding_encoder_resolution,
title={Understanding Encoder Resolution and Its 3 Forms},
author={US Digital},
howpublished={https://www.usdigital.com/blog/understanding-encoder-resolution-and-its-3-forms/},
year={2019},
note = {Accedido: 5-10-2022},
}
And here is the output:
This is the wrong URL I get when I click the hyperref:
I hope you can help me.
Thanks in advance.

Related

Can we use the bib file from web for pdf_book in Bookdown?

In Rmarkdown,
we can generate pdf file if we use the bib file from a url.
For example, we set
bibliography: [https://raw.githubusercontent.com/ChoCho66/test/main/text.bib]
In bookdown, we can also generate html book (gitbook, bs4_book)
if we use the bib file from a url.
But pdf_book doesn't work.
It has the following message.
! Undefined control sequence.
\hyper#normalise ...M{ }\catcode `\%\active \let %
\#percentchar \let \%\#per...
l.430 ...content.com/ChoCho66/test/main/text.bib}}
The tex file has wrong since the following:
\bibliography{\url{https://raw.githubusercontent.com/ChoCho66/test/main/text.bib}}
Is there any way to solve this problem?
I want a common code that can compile both html book and pdf book if we use bib url file.

importxml of url with Hebrew returns in encoding other than UTF-8 that chrome doesn't recognize

For example, in the dummy spreadsheet (tab 'desired outcome'), under "Link 1" you will see this URL:
http://www.promotion-il.co.il/service/%5DE%5E4%5D9%5E5-%5E8%5D9%5D7-%5D7%5E9%5DE%5DC%5D9-%5DC%5E2%5E1%5E7%5D9%5DD/
However, the actual URL in UTF-8 is:
http://www.promotion-il.co.il/service/%D7%9E%D7%A4%D7%99%D7%A5-%D7%A8%D7%99%D7%97-%D7%97%D7%A9%D7%9E%D7%9C%D7%99-%D7%9C%D7%A2%D7%A1%D7%A7%D7%99%D7%9D/
The actual URL string that contains Hebrew is:
http://www.promotion-il.co.il/service/מפיץ-ריח-חשמלי-לעסקים/
I will also add that the same URL has returned with a proper UTF-8 encoding for other blog posts. (See second example in the same tab).
Why is it happening?
How can it be fixed?
Thanks in advance!
This is the solution I came up with eventually:
I saw that for the imported urls - in order to fix a broken url 2 repalcements were needed:
5D --> D7%9
5E --> D7%A
I used this formula in a separate column to achieve it:
==ARRAYFORMULA(SUBSTITUTE(SUBSTITUTE((<COLUMN WITH IMPORTED URLS HERE>),"5D","D7%9"),"5E","D7%A"))

Chinese fonts in English Document of Latex

I would like to ask how to write some Chinese characters (simplified) within an English Latex document? I followed some methods suggested online but none of them are working for me. I am using Texworks to compile my document (pdfLatex).
\usepackage{CJKutf8}
\documentclass{article}
\usepackage{CJKutf8}
\begin{document}
\begin{CJK}{UTF8}{bsmi}
你好吗
\end{CJK}
\end{document}
another method is mentioned here,
\usepackage{CJKutf8}
\newcommand{\zh}[1]{\begin{CJK}{UTF8}{gbsn}#1\end{CJK}}
insert whatever Chinese characters you like into the document by typing
\zh{中文}
The error I receive is as following when using the method of CJKutf8 with pdfLaTeX mentioned here,
The log file hopefully contains the information to get MiKTeX going again:
C:\Users\muhd_\AppData\Local\MiKTeX\2.9\miktex\log\miktex-makepk.log
)
!pdfTeX error: miktex-pdftex.exe (file bsmiu30): Font bsmiu30 at 600 not found
==> Fatal error occurred, no output PDF file produced!
Log file is attached here.
The first error in the log file
The process terminated due to an access violation.
suggests that there are some problems with permissions while trying to create the necessary files to use the font for the Chinese letters.
Based on this answer https://tex.stackexchange.com/a/419809/36296 the problem can be fixed by either runing updmap from the command line or Refresh font map files from within the miktex console (in user mode).

got wrong characters encoding using pdfbox to extract text from pdf

Recently,I have to index pdf into ElasticSearch and using pdfbox to extract text from pdf, however I got wrong characters encoding like this
Ýëĭ2ĈjŬj§ė¥
1 ŋ?nij"2$ 2016£ 2Ú 5Õ,”Òªj§?ně#ij"2ě
^ë2ļŘœ A$j§?n 2016£ě#ëÖĭ2Ĉļê
2 èÅŋ?n$ 2016£ 2Ú 6ÕöĿS¿ ĿS¿ ĿS
Õ¿ ĿSÖ¿ eöĿS&غĨĘ
http://www.sse.com.cnLćĈ
A$j§Ýëĭ2ĈŘĐ
My code is exactly the same as this page says here. I try pdfbox lib version from 0.8.x to 2.0.x, but it still can not work.
Any help or advice will be grateful!
I got answer from #Tilman comment.
See pdfbox.apache.org/1.8/faq.html#notext and the answer below too.

LaTex compilation doesn't generate output

Can't generate the output of my LaTex document (using TeXnicCenter). At the end of the log he says:
Latex-Result: 0 Error(s), 0
Warning(s), 0 Bad Box(es), 0 Page(s)
and correspondingly the viewer says it couldn't find the file.
I have defined the path of the pdf viewer but it didn't help. The path of the source document doesn't contain spaces. Does anyone know how to fix this problem?
Thanks in advance!
Thank you very much, Steve, but I've just found the problem: I had to change the "Install packages on-the-fly"-setting to "yes" instead of "ask me first".
In my case I uploaded the project zip then this error was shown ,
The fix was to first create blank project then upload the zip file contents.

Resources