Pandoc Citeproc doesn't work on HTML format - lua

I'm trying to reference cites from the .bib file in the HTML but without success. The function perfectly works for Markdown, so my question is does the citeproc work on other formats except for MD?
Here are some examples which I use:
<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml" lang="" xml:lang="en">
<body>
Test [#test1]
</body>
</html>
Command: pandoc --bibliography=test.bib --citeproc test.html -o test.html -s --metadata-file=test.yaml
The .bib file contains the following:
#article{test1,
author = {Rathod, N and Kulawik, P and Ozogul, Y and Ozogul, F and Bekhit, A},
title = {Recent developments in non-thermal processing for seafood and seafood products: cold plasma, pulsed electric field and high hydrostatic pressure},
journal = {International Journal of Food Science & Technology},
date = {2022},
year = {2022},
pages = {774--790},
volume = {57},
number = {2},
doi = {10.1111/ijfs.15392},
raw = {Rathod, N. B., Kulawik, P., Ozogul, Y., Ozogul, F., & Bekhit, A. E. D. A. (2022). Recent
developments in non-thermal processing for seafood and seafood products: cold plasma, pulsed
electric field and high hydrostatic pressure. International Journal of Food Science &
Technology, 57(2), 774-790. https://doi.org/10.1111/ijfs.15392}
}
I have created the Lua filter which covers only partial cases. I'm a newbie in Lua and can not currently make the complex filter as we have it for MD.
Thank you.

Related

Print bibliography: Empty bibliography

It’s been days I’ve been looking for a valid solution, but it just never works.
The minimal reproducible example is the following:
\documentclass[12pt,a4paper]{article}
\usepackage[utf8]{inputenc}
\usepackage[english]{babel}
\usepackage{csquotes}
\usepackage{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage[authordate,autocite=inline,backend=biber,sorting=nyt,]{biblatex-chicago}
\addbibresource{Test.bib}
\usepackage[left=2cm,right=2cm,top=2cm,bottom=2cm]{geometry}
\DeclareUnicodeCharacter{202F}{\,}
\begin{document}
Test citing \citep{sanborn_learning_2014}
\printbibliography
\end{document}
The Test.bib:
#article{sanborn_learning_2014,
title = {Learning Democracy: Education and the Fall of Authoritarian Regimes},
volume = {44},
issn = {0007-1234, 1469-2112},
url = {https://www.cambridge.org/core/product/identifier/S0007123413000082/type/journal_article},
doi = {10.1017/S0007123413000082},
shorttitle = {Learning Democracy},
abstract = {Studies on what causes a state to democratize have focused on economic, social, and international factors. Many of them argue that higher levels of education should promote democracy. However, few articulate clearly how education affects democratization, and fewer still attempt to test the supposed link across time and space. This article fills that gap by considering how different levels of education influence democratization, and the conditions under which education is most likely to promote democracy. Analyses of eighty-five authoritarian spells from 1970 to 2008 find that higher levels of mass, primary, and tertiary education are robustly associated with democratization. Secondary analyses indicate that education is most effective in promoting democratization when both males and females are educated. An illustration from Tunisia follows.},
pages = {773--797},
number = {4},
journaltitle = {British Journal of Political Science},
shortjournal = {Brit. J. Polit. Sci.},
author = {Sanborn, Howard and Thyne, Clayton L.},
urldate = {2021-09-18},
date = {2014-10},
langid = {english},
The Test.blg:
[0] Config.pm:304> INFO - This is Biber 2.14 (beta)
[0] Config.pm:307> INFO - Logfile is 'Test.blg'
[38] biber:322> INFO - === dim. sept. 19, 2021, 14:47:26
[48] Biber.pm:415> INFO - Reading 'Test.bcf'
[128] Biber.pm:943> INFO - Found 0 citekeys in bib section 0
[131] Utils.pm:293> WARN - The file 'Test.bcf' does not contain any citations!
[136] bbl.pm:652> INFO - Writing 'Test.bbl' with encoding 'UTF-8'
[136] bbl.pm:755> INFO - Output to Test.bbl
[136] Biber.pm:128> INFO - WARNINGS: 1
The warning message when not citing in the text is LaTeX Warning: Empty bibliography on input line 16.
The error and warning messages when citing in the text are:
On line 14:
! Undefined control sequence.
! Missing $ inserted.
! Extra }, or forgotten $.
On line 15:
! Missing $ inserted.
On line 16:
LaTeX Warning: Empty bibliography on input line 16.
I’m running on Linux, Texlive2019 (apt installed that one, not a more recent one) and texmaker on Ubuntu 20.04.
Anyone has a solution?
Two problems:
the } at the end of your bib entry was missing
\citep{} is not provided by biblatex by default. You can use \parencite{} instead or the natbib=true option of biblatex
\documentclass[12pt,a4paper]{article}
\usepackage[utf8]{inputenc}
\usepackage[english]{babel}
\usepackage{csquotes}
\usepackage{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage[authordate,autocite=inline,backend=biber,sorting=nyt,natbib=true]{biblatex-chicago}
\begin{filecontents*}[overwrite]{\jobname.bib}
#article{sanborn_learning_2014,
title = {Learning Democracy: Education and the Fall of Authoritarian Regimes},
volume = {44},
issn = {0007-1234, 1469-2112},
url = {https://www.cambridge.org/core/product/identifier/S0007123413000082/type/journal_article},
doi = {10.1017/S0007123413000082},
shorttitle = {Learning Democracy},
abstract = {Studies on what causes a state to democratize have focused on economic, social, and international factors. Many of them argue that higher levels of education should promote democracy. However, few articulate clearly how education affects democratization, and fewer still attempt to test the supposed link across time and space. This article fills that gap by considering how different levels of education influence democratization, and the conditions under which education is most likely to promote democracy. Analyses of eighty-five authoritarian spells from 1970 to 2008 find that higher levels of mass, primary, and tertiary education are robustly associated with democratization. Secondary analyses indicate that education is most effective in promoting democratization when both males and females are educated. An illustration from Tunisia follows.},
pages = {773--797},
number = {4},
journaltitle = {British Journal of Political Science},
shortjournal = {Brit. J. Polit. Sci.},
author = {Sanborn, Howard and Thyne, Clayton L.},
urldate = {2021-09-18},
date = {2014-10},
langid = {english},
}
\end{filecontents*}
\addbibresource{\jobname.bib}
\usepackage[left=2cm,right=2cm,top=2cm,bottom=2cm]{geometry}
\DeclareUnicodeCharacter{202F}{\,}
\begin{document}
Test citing \citep{sanborn_learning_2014}
\printbibliography
\end{document}

Extracting text from APA citation

I have a spreadsheet containing APA citation style text and I want to split them into author(s), date, and title.
An example of a citation would be:
Parikka, J. (2010). Insect Media: An Archaeology of Animals and Technology. Minneapolis: Univ Of Minnesota Press.
Given this string is in field I2 I managed to do the following:
Name: =LEFT(I2, FIND("(", I2)-1) yields Parikka, J.
Date: =MID(I2,FIND("(",I2)+1,FIND(")",I2)-FIND("(",I2)-1) yields 2010
However, I am stuck at extracting the name of the title Insect Media: An Archaeology of Animals and Technology.
My current formula =MID(I2,FIND(").",I2)+2,FIND(").",I2)-FIND(".",I2)) only returns the title partially - the output should show every character between ).and the following ..
I tried =REGEXEXTRACT(I2, "\)\.\s(.*[^\.])\.\s" ) and this generally works but does not stop at the first ". " - Like with this example:
Sanders, E. B.-N., Brandt, E., & Binder, T. (2010). A framework for organizing the tools and techniques of participatory design. In Proceedings of the 11th biennial participatory design conference (pp. 195–198). ACM. Retrieved from http://dl.acm.org/citation.cfm?id=1900476
Where is the mistake?
The title can be found (in the two examples you've given, at least) with this:
=MID(I2,find("). ",I2)+3,find(". ",I2,find("). ",I2)+3)-(find("). ",I2)+3)+1)
In English: Get the substring starting after the first occurrence of )., up to and including the first occurrence of . following.
If you wish to use REGEXEXTRACT, then this works (on your two examples). (You can also see a Regex101 demo.):
=REGEXEXTRACT(I3,"(?:.*\(\d{4}\)\.\s)([^.]*\.)(?: .*)")
Where is the mistake?
In your expression, you were capturing (.*[^\.]), which greedily includes any number of characters followed by a character in the character class not (backslash or dot), which means that multiple sentences can be captured. The expression finished with \.\s, which wasn't captured, so the capture group would end before a period-then-space, rather than including it.
Try:
=split(SUBSTITUTE(SUBSTITUTE(I2, "(",""), ")", ""),".")
If you don't replace the parentheses around 2010, it thinks it is a negative number -2010.
For your Title try adding index split to your existing formula:
=index(split(REGEXEXTRACT(A5, "\)\.\s(.*[^\.])\.\s" ),"."),0,1)&"."

Unable to load Biobliography using {biblatex} package and \printbibliography

This is the code I use for my bibliography
\usepackage[style=authoryear,sorting=ynt]{biblatex}
\addbibresource{bibliography.bib}
\printbibliography
This is an example of my bibliopgrahy:
#Article{Acosta2008,author = {Acosta, EG and Castilla, V and Damonte, EB}, title = {Functional entry of dengue virus into Aedes albopictusmosquito cells is dependent on clathrin-mediated endocytosis.}, journal = {J Gen Virol}, volume = {89}, number = {Pt 2}, pages = {474--484}, year = {2008}, abstract = {Entry of dengue virus 2 (DENV-2) into Aedes albopictus mosquito C6/36 cells was analysed using biochemical and molecular inhibitors, together with confocal and electron microscopy observations. Treatment with monodansylcadaverine, chlorpromazine, sucrose and ammonium chloride inhibited DENV-2 virus yield and protein expression, whereas nystatin, a blocker of caveolae-mediated endocytosis, did not have any effect. Using confocal microscopy, co-localization of DENV-2 E glycoprotein and the marker protein transferrin was observed at the periphery of the cytoplasm. To support the requirement of clathrin function for DENV-2 entry, overexpression of a dominant-negative mutant of Eps15 in C6/36 cells was shown to impair virus entry. The disruption of actin microfilaments by cytochalasin D also significantly affected DENV-2 replication. In contrast, microtubule disruption by colchicine treatment did not impair DENV-2 infectivity, suggesting that DENV-2 does not require transport from early to late endosomes for successful infection of mosquito cells. Furthermore, using transmission electron microscopy, DENV-2 particles of approximately 44-52 nm were found attached within electron-dense invaginations of the plasma membrane and in coated vesicles that resembled those of clathrin-coated pits and vesicles, respectively. Together, these results demonstrate for the first time that DENV-2 enters insect cells by receptor-mediated, clathrin-dependent endocytosis, requiring traffic through an acidic pH compartment for subsequent uncoating and completion of a productive infection.}, location = {}, keywords = {}}
I have tried several ways to load the bibliography stored in my bibliography.bib file with no results.
The error is:
Empty bibliography on input line
The bibliography is not empty, and the bibligraphy is on the same folder as the main '.tex' file
If your bibliography with the above content is bibliography.bib, and the following
\documentclass{article}
\usepackage[style=authoryear,sorting=ynt]{biblatex}
\addbibresource{bibliography.bib}
\begin{document}
Text ... citation: \cite{Acosta2008}.
\printbibliography
\end{document}
is main.tex in the same folder, then you should have no problem.
Make sure that you compile once, then you compile the bibliography with Biber, then you compile the tex one last time.
Confront your minimal code with mine but - mainly - check in the settings of your editor that Biber, not BibTex, is used to build the bibliography (example: TeXstudio).
I had the same problem. But it worked for me when I deleted the whole [] at usepackage.
So, for \usepackage[style=alphabetic]{biblatex}, I deleted the middle part so there only was \usepackage{biblatex}. I ran it and it worked. After that, I could use this one again \usepackage[style=alphabetic]{biblatex} and it worked too.

Pandoc --bibliography with Chicago Fullnote Bibliography missing space between contributors and title in citation

When using Pandoc for my citations (pandoc -o FileOut.tex FileIn.md -S --bibliography=Primary.bib --csl=chicago-fullnote-bibliography.csl) there is no space produced between the author (or contributors) and the title within a footnote. The spaces are correct in the bibliography. Any ideas?
Example:
#Roudinesco2008
produces this citation in a footnote
Elisabeth Roudinesco\emph{Philosophy
in Turbulent Times: Canguilhem, Sartre, Foucault, Althusser, Deleuze,
Derrida}, trans. William McCuaig (New York: Columbia University Press,
2008).
but correctly generates this entry in a bibliography
Roudinesco, Elisabeth. \emph{Philosophy in Turbulent Times: Canguilhem,
Sartre, Foucault, Althusser, Deleuze, Derrida}. Translated by William
McCuaig. New York: Columbia University Press, 2008.
The .bib entry is as follows:
#book{Roudinesco2008,
location = {New York},
title = {Philosophy in turbulent times: Canguilhem, Sartre, Foucault, Althusser, Deleuze, Derrida},
isbn = {9780231143004},
url = {},
shorttitle = {Philosophy in turbulent times},
pagetotal = {184},
publisher = {Columbia University Press},
author = {Roudinesco, Elisabeth},
translator = {McCuaig, William},
date = {2008},
keywords = {20th century, Education, Higher, France, History, Philosophers, Philosophy, Philosophy, French}
}
I'm pretty sure it's not a problem with my csl file, which is directly from the git repository. Maybe citeproc?
Any help is much appreciated.

lyx problem: ps, pdf "Authors not shown"

I'm trying to write a paper based on VLDB .cls and .tex files, it can be reached from here.It uses ACM SIG Proceedings Style.After fixing many errors, now i don't get any errors, However when i save my file as pdf or ps,i can not see the author names.
The title and abstract are shown just not the code between these is not shown.
i'm using lyx on Ubuntu.
Here is the code for the authors.
% ****************** TITLE ****************************************
\title{A Sample {\ttlit VLDB} Proceedings Paper in LaTeX
Format\titlenote{for use with vldb.cls}}
\subtitle{[Extended Abstract]
\titlenote{A full version of this paper is available as\textit{Author's Guide to Preparing ACM SIG Proceedings Using \LaTeX$2_\epsilon$\ and BibTeX} at \texttt{www.acm.org/eaddress.htm}}}
% ****************** AUTHORS **************************************
% You need the command \numberofauthors to handle the 'placement
% and alignment' of the authors beneath the title.
%
% For aesthetic reasons, we recommend 'three authors at a time'
% i.e. three 'name/affiliation blocks' be placed beneath the title.
%
% NOTE: You are NOT restricted in how many 'rows' of
% "name/affiliations" may appear. We just ask that you restrict
% the number of 'columns' to three.
%
% Because of the available 'opening page real-estate'
% we ask you to refrain from putting more than six authors
% (two rows with three columns) beneath the article title.
% More than six makes the first-page appear very cluttered indeed.
%
% Use the \alignauthor commands to handle the names
% and affiliations for an 'aesthetic maximum' of six authors.
% Add names, affiliations, addresses for
% the seventh etc. author(s) as the argument for the
% \additionalauthors command.
% These 'additional authors' will be output/set for you
% without further effort on your part as the last section in
% the body of your article BEFORE References or any Appendices.
\numberofauthors{8} % in this sample file, there are a *total*
% of EIGHT authors. SIX appear on the 'first-page' (for formatting
% reasons) and the remaining two appear in the \additionalauthors section.
\author{
% You can go ahead and credit any number of authors here,
% e.g. one 'row of three' or two rows (consisting of one row of three
% and a second row of one, two or three).
%
% The command \alignauthor (no curly braces needed) should
% precede each author name, affiliation/snail-mail address and
% e-mail address. Additionally, tag each line of
% affiliation/address with \affaddr, and tag the
% e-mail address with \email.
%
% 1st. author
\alignauthor
Ben Trovato\titlenote{Dr.~Trovato insisted his name be first.}\\
\affaddr{Institute for Clarity in Documentation}\\
\affaddr{1932 Wallamaloo Lane}\\
\affaddr{Wallamaloo, New Zealand}\\
\email{trovato#corporation.com}
% 2nd. author
\alignauthor
G.K.M. Tobin\titlenote{The secretary disavows
any knowledge of this author's actions.}\\
\affaddr{Institute for Clarity in Documentation}\\
\affaddr{P.O. Box 1212}\\
\affaddr{Dublin, Ohio 43017-6221}\\
\email{webmaster#marysville-ohio.com}
% 3rd. author
\alignauthor Lars Th{\Large{\sf{\o}}}rv{$\ddot{\mbox{a}}$}ld\titlenote{This author is the
one who did all the really hard work.}\\
\affaddr{The Th{\large{\sf{\o}}}rv{$\ddot{\mbox{a}}$}ld Group}\\
\affaddr{1 Th{\large{\sf{\o}}}rv{$\ddot{\mbox{a}}$}ld Circle}\\
\affaddr{Hekla, Iceland}\\
\email{larst#affiliation.org}
\and % use '\and' if you need 'another row' of author names
% 4th. author
\alignauthor Lawrence P. Leipuner\\
\affaddr{Brookhaven Laboratories}\\
\affaddr{Brookhaven National Lab}\\
\affaddr{P.O. Box 5000}\\
\email{lleipuner#researchlabs.org}
% 5th. author
\alignauthor Sean Fogarty\\
\affaddr{NASA Ames Research Center}\\
\affaddr{Moffett Field}\\
\affaddr{California 94035}\\
\email{fogartys#amesres.org}
% 6th. author
\alignauthor Charles Palmer\\
\affaddr{Palmer Research Laboratories}\\
\affaddr{8600 Datapoint Drive}\\
\affaddr{San Antonio, Texas 78229}\\
\email{cpalmer#prl.com}
}
% There's nothing stopping you putting the seventh, eighth, etc.
% author on the opening page (as the 'third row') but we ask,
% for aesthetic reasons that you place these 'additional authors'
% in the \additional authors block, viz.
\additionalauthors{Additional authors: John Smith (The Th{\o}rv\"{a}ld Group,
email: {\texttt{jsmith#affiliation.org}}) and Julius P.~Kumquat
(The Kumquat Consortium, email: {\texttt{jpkumquat#consortium.net}}).}
\date{30 July 1999}
% Just remember to make sure that the TOTAL number of authors
% is the number that will appear on the first page PLUS the
% number that will appear in the \additionalauthors section.
\maketitle
i also tried another author format and still author names not shown.
% ****************** TITLE ****************************************
\title{Alternate {\ttlit ACM} SIG Proceedings Paper in LaTeX
Format\titlenote{(Produces...}}
\numberofauthors{3}
% Three authors sharing the same affiliation.
\author{
\alignauthor Ben King\\
\email{king#cs.berkeley.edu}
%
\alignauthor Georgia Tobin\\
\email{tobin#cs.berkeley.edu}
%
\alignauthor Gerald Murray\\
\email{murrray#cs.berkeley.edu}
%
\sharedaffiliation
\affaddr{Department of Electrical Engineering and Computer Science } \\
\affaddr{University of California, Berkeley } \\
\affaddr{Berkeley, CA 94720-1776 }
}
%
\maketitle
also when i checked the title i see that:
Error in latexParagraphs: You should not mix title layouts with normal ones.
however i see my the document as dvi,ps or pdf
i'm not sure what causes the problem?
EDIT
WHILE converting from .lyx to pdf or dvi i still don't see the authors however, by using shell commands everthing is fine.
i think this is a problem of lyx,
i spend 8 hours to solve it, seriously i set up windows 7 lyx and tried there, still same.
thanks lyx:) You are good when you are working properly.
The vldb.cls class from the link you mention does not define a command called \sharedaffiliation. Commenting out that code allows the file to compile and also see the author names:
\documentclass{vldb}
\title{Alternate {\ttlit ACM} SIG Proceedings Paper in LaTeX
Format\titlenote{(Produces...}}
\numberofauthors{3}
\author{
\alignauthor Ben King\\
\email{king#cs.berkeley.edu}
\alignauthor Georgia Tobin\\
\email{tobin#cs.berkeley.edu}
\alignauthor Gerald Murray\\
\email{murrray#cs.berkeley.edu}
% \sharedaffiliation
\affaddr{Department of Electrical Engineering and Computer Science } \\
\affaddr{University of California, Berkeley } \\
\affaddr{Berkeley, CA 94720-1776 }
}
\begin{document}
\maketitle
\end{document}
The address is ugly this way, though. It is up to the journal to define how to display author addresses with a single affiliation. Unfortunately, the sample file does not do this. In that situation I would contact the editor and ask.

Resources