Why could curl and wget timeout every now and then

Why could curl and wget timeout every now and then - travis-ci

Every now and then (<5%) Travis builds stall because wget calling a PHP page times out. Example: https://travis-ci.org/marcelstoer/nodemcu-custom-build/builds/69239694 I was using curl first for several weeks but then tried wget because of this issue.
cURL
curl -m 60 url
wget
wget -qO- url &> /dev/null
The server which serves the PHP page doesn't report any errors and in the Apache access log the request to the PHP page is reported as returning HTTP 200.
What could I do to further analyze?

Related

Self hosted GitHub Action Runner jobs failing

We make use of Github Self-Hosted action runners running on EC2 machines (m5.xlarge). We use these as part of our CI/CD pipeline to support docker image builds and automated testing. This solution has worked fine for the last year or so, but all of a sudden yesterday, the builds started to fail with the following error message :
time="2023-02-03T12:00:13Z" level=error msg="error waiting for container: unexpected EOF"
My understanding of this is that it is typically due to docker containers running out of resources (CPU / Memory Limit) being hit but given that these are m5.xlarges (4 vCPU and 16GB Memory) I'm a little surprised. Our builds make use of NPM which I understand can be quite resource hungry but monitoring a container during its execution showed that it was nowhere near the limits of the node:
I've tried to cycle the nodes but there is no difference in behaviour. The following user-data script is used with these nodes which connects it to our Github account and makes it available for jobs. I've also tried using the latest actions-runneer package, but again, no change in behaviour. What other reasons could this error be thrown for as i'm a bit stumped by this.
#!/bin/sh
set -e
curl https://get.docker.com | bash
apt install -y python3-pip jq
pip3 install awscli
mkdir actions-runner && cd actions-runner
curl -O -L https://github.com/actions/runner/releases/download/v2.286.0/actions-runner-linux-x64-2.286.0.tar.gz
tar xzf ./actions-runner-linux-x64-2.286.0.tar.gz
chown -R ubuntu:ubuntu .
instance_id="$(curl -s http://169.254.169.254/latest/meta-data/instance-id)"
url="https://api.github.com/orgs/<REMOVED>/actions/runners/registration-token"
token=$(curl -s -u "<REMOVED>:<REMOVED>" -X POST "$url" | jq -r .token)
sudo -u ubuntu ./config.sh \
--name "products-stage-ec2-runner-$instance_id" \
--token "$token" \
--url "https://github.com/<REMOVED>" \
--labels "<REMOVED>" \
--unattended
sudo ./svc.sh install
sudo ./svc.sh start

See details on my comment of how I resolved this.

504 timeout on wget command when executed from shell script

I have this wget command :
wget -P /home/storyplayr/storyplayr-web-playr -O sitemap.xml https://preprod.storyplayr.com/sitemap.xml?snapshots=true&onlyBlog=true --no-check-certificate
When executed from the command line, it works fine
When executed from a shell script (.sh), I have a 504 Getaway Time-out
I have tried to set up the time out on the wget command but it does nothing. It is clearly related to the sh command...
Any idea why ? Any idea on how to avoid this ?
Thanks !!!

Merge these wget & egrep commands for recursive download of sitemap

I am trying to find a way to make these work together. Whereas I can run this successfully using Wget for Windows:
wget --html-extension -r http://www.sitename.com
this downloads every single file on my server that is directory linked from the root domain. I'd rather download only the pages in my sitemap. For this, I found the following trick which uses CygWin:
wget --quiet https://www.sitename.com/sitemap.xml --output-document - | egrep -o
"http://www\.sitename\.com[^<]+" | wget --spider -i - --wait 1
However this is only checking that the pages exist, not downloading them as static HTML files as the prior wget command is doing.
Is there a way to merge these and download the sitemap pages as local html files?

If you look at the man page for wget, you will see that the --spider entry is as follows:
--spider
When invoked with this option, Wget will behave as a Web spider, which means that it will not download the pages, just check that they are there.
All you need to do to actually download the file is remove the --spider from your command.
wget --quiet https://www.sitename.com/sitemap.xml --output-document - | egrep -o \
"https?://www\.sitename\.com[^<]+" | wget -i - --wait 1

redhat wget with zero timeout ignored

I'm running wget command
wget -O - -q -t 1 --timeout=0 'http://servername.com/url'
which invokes nodejs script running in the background. Once nodejs script completes, it shows detail in the output. If it's running for 2-3 minutes then everything is ok, but if it's running longer than 5 minutes, wget command just quits\ends\terminates without output and the script still remains running in the background. This can only be seen in the application server logs. Basically, the required output is lost.
Why does wget close even though the timeout is set to 0? It should wait for hours (on Debian it works btw, but on Redhat I see the above behavior)? Any solution that can be suggested? Thanks.

Getting Wget and JQL to run

We are trying to run a command wget -O xyz.xls --user=COldPolar --password=GlacierICe --ignore-length=on "http://Colder.near.com:8080/sr/jira.issueviews:searchrequest-excel-current-fields/temp/SearchRequest.xls?&runQuery=true(jqlQuery=project%3DCCD)&tempMax=1000"
This is returning a 3kb output
If we open IE and use the following "http://Colder.near.com:8080/sr/jira.issueviews:searchrequest-excel-current-fields/temp/SearchRequest.xls?&runQuery=true(jqlQuery=project%3DCCD)&tempMax=1000" This allows us to save a 1.7MB file. Please advise how to get the wget to work

If you can use cURL you can do:
curl -o xyz.xls -u COldPolar:GlacierICe 'http://Colder.near.com:8080/sr/jira.issueviews:searchrequest-excel-current-fields/temp/SearchRequest.xls?&runQuery=true(jqlQuery=project%3DCCD)&tempMax=1000'
What I managed to get wget to work was to do this first:
wget --save-cookies cookies.txt --post-data 'os_username=COldPolar&os_password=GlacierICe&os_cookie=true' http://Colder.near.com:8080/login.jsp
And then:
wget -O xyz.xls --load-cookies cookies.txt "http://Colder.near.com:8080/sr/jira.issueviews:searchrequest-excel-current-fields/temp/SearchRequest.xls?&runQuery=true(jqlQuery=project%3DCCD)&tempMax=1000"

One of the things to watch out for is a self-signed certificate. You can rule this out by running with --no-check-certificate.

Develop Reference

ios ruby-on-rails asp.net-mvc docker delphi jenkins grails google-sheets machine-learning dart

Why could curl and wget timeout every now and then - travis-ci

Related

Self hosted GitHub Action Runner jobs failing

504 timeout on wget command when executed from shell script

Merge these wget & egrep commands for recursive download of sitemap

redhat wget with zero timeout ignored

Getting Wget and JQL to run

Categories

Resources