From debbugs-submit-bounces@debbugs.gnu.org Wed Mar 01 16:35:52 2023 Received: (at 60976) by debbugs.gnu.org; 1 Mar 2023 21:35:52 +0000 Received: from localhost ([127.0.0.1]:55134 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1pXU76-0001Ii-47 for submit@debbugs.gnu.org; Wed, 01 Mar 2023 16:35:52 -0500 Received: from mail-pj1-f53.google.com ([209.85.216.53]:40670) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1pXU74-0001IS-5k for 60976@debbugs.gnu.org; Wed, 01 Mar 2023 16:35:50 -0500 Received: by mail-pj1-f53.google.com with SMTP id m20-20020a17090ab79400b00239d8e182efso675474pjr.5 for <60976@debbugs.gnu.org>; Wed, 01 Mar 2023 13:35:50 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:to :from:from:to:cc:subject:date:message-id:reply-to; bh=f8OJJnhNDE9a50xSpH6EKQaWbUJYmNHVkTWWGRikttc=; b=UQOLuCyP2UdjwsvkciZUHVp+n54HsVy23mqg6mCIAcOGHf3ASvxiRaEffh7muLlHWJ yiugmgmqqEsCuS7c/XzNUlXINrEmhchTJrdBJvoccOlJy633IcbngFzx4hetOstWoFGZ aBpxYOsKO9F69ECE6hUekHub2W1rTG63nUC9SzoIIlEDOkHyD5mzzjmPEkEuddiK9du+ mhD/xkYiOYe6GnqA7rVJxmaKAFcX4Z0K6lWq2m3RdDbghP451a3YKYiwnMcDtAlKH5WT 37mrVTMccTnnte/yE+CkkSEjIcb1dzQQ1XlVgPNsmBYlhykWLPYd3S/iYb+/mOV6ZL7Z wHUg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:to :from:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=f8OJJnhNDE9a50xSpH6EKQaWbUJYmNHVkTWWGRikttc=; b=2e/AFhXcBe0WdXGIb5zywaBxu/zuG6+UCHzpajagItFj63FAs/WX9axllqr7WhKRG9 CLgQn2ROnxObhTgWBQyPnCz09sDbKWpzoGDtTRCRH7yVtHPn0N53bLxdY15ckeyrhiuE ULKFCqQ7/wbkQDyQp479g2pG6+9KgANawdp3cTt5ONTPSlNq6Aw7FRwP8PuQ9fQncSlU zdYmOF3RcLlDg0enw32k9C7zBACl+chfJiZf6yZa1cxctvRleWclTWBdYM8UujqaGfYC fbO5iN3+N/lZ8YbfW2s2Xk6r/+NbqG1pHe/QesNyb1LfJCWpxchh4EddIOHyatKibCLX yTZQ== X-Gm-Message-State: AO0yUKXbk9PthU5ydVH6pyfVi3IjQGfvM88YYr5QnUn34b7JhSeNjtsg bRJJugmJV344bL1qL5hki/mGVtL7A/k= X-Google-Smtp-Source: AK7set+ygYMzmG/teeuiNOGqM9D1fUw1ZJxMR/RMOBdkU2ZFZp/E/Vi8tsaG0UO6N4glwQvB3YAzLA== X-Received: by 2002:a17:90b:4d8e:b0:237:373e:9ded with SMTP id oj14-20020a17090b4d8e00b00237373e9dedmr8947971pjb.22.1677706543893; Wed, 01 Mar 2023 13:35:43 -0800 (PST) Received: from Ginko ([66.170.190.211]) by smtp.gmail.com with ESMTPSA id b3-20020a17090acc0300b00232cc61e16bsm214371pju.35.2023.03.01.13.35.42 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 01 Mar 2023 13:35:43 -0800 (PST) From: Frank Pursel To: 60976@debbugs.gnu.org, julien@lepiller.eu Subject: [PATCH v4 1/3] gnu: Add java-jericho-html Date: Thu, 26 Jan 2023 08:46:25 -0800 Message-Id: <48adf8ab622e5051d0012bfb45703c8a73cad92c.1677706130.git.frank.pursel@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 2.8 (++) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: --- Posting a full update. I believe I've addressed all the identified issues concerning java-jericho-html... gnu/packages/java-xml.scm | 63 +++++++++++++++++++++++++++++++++++---- 1 file changed, 58 insertions(+), 5 deletions(-) Content analysis details: (2.8 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -0.0 RCVD_IN_DNSWL_NONE RBL: Sender listed at https://www.dnswl.org/, no trust [209.85.216.53 listed in list.dnswl.org] 0.1 URIBL_SBL_A Contains URL's A record listed in the Spamhaus SBL blocklist [URIs: jericho.htmlparser.net] 0.6 URIBL_SBL Contains an URL's NS IP listed in the Spamhaus SBL blocklist [URIs: jericho.htmlparser.net] 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (frank.pursel[at]gmail.com) 2.1 DATE_IN_PAST_96_XX Date: is 96 hours or more before Received: date 0.0 SPF_HELO_NONE SPF: HELO does not publish an SPF Record 0.0 RCVD_IN_MSPIKE_H3 RBL: Good reputation (+3) [209.85.216.53 listed in wl.mailspike.net] -0.0 SPF_PASS SPF: sender matches SPF record 0.0 RCVD_IN_MSPIKE_WL Mailspike good senders X-Debbugs-Envelope-To: 60976 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.8 (+) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: --- Posting a full update. I believe I've addressed all the identified issues concerning java-jericho-html... gnu/packages/java-xml.scm | 63 +++++++++++++++++++++++++++++++++++---- 1 file changed, 58 insertions(+), 5 deletions(-) Content analysis details: (1.8 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- 0.0 RCVD_IN_MSPIKE_H3 RBL: Good reputation (+3) [209.85.216.53 listed in wl.mailspike.net] -0.0 RCVD_IN_DNSWL_NONE RBL: Sender listed at https://www.dnswl.org/, no trust [209.85.216.53 listed in list.dnswl.org] 0.1 URIBL_SBL_A Contains URL's A record listed in the Spamhaus SBL blocklist [URIs: jericho.htmlparser.net] 0.6 URIBL_SBL Contains an URL's NS IP listed in the Spamhaus SBL blocklist [URIs: jericho.htmlparser.net] 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (frank.pursel[at]gmail.com) 2.1 DATE_IN_PAST_96_XX Date: is 96 hours or more before Received: date 0.0 SPF_HELO_NONE SPF: HELO does not publish an SPF Record -0.0 SPF_PASS SPF: sender matches SPF record 0.0 RCVD_IN_MSPIKE_WL Mailspike good senders -1.0 MAILING_LIST_MULTI Multiple indicators imply a widely-seen list manager --- Posting a full update. I believe I've addressed all the identified issues concerning java-jericho-html... gnu/packages/java-xml.scm | 63 +++++++++++++++++++++++++++++++++++---- 1 file changed, 58 insertions(+), 5 deletions(-) diff --git a/gnu/packages/java-xml.scm b/gnu/packages/java-xml.scm index 5b16806e27..faa51a66fd 100644 --- a/gnu/packages/java-xml.scm +++ b/gnu/packages/java-xml.scm @@ -6,6 +6,7 @@ ;;; Copyright =C2=A9 2020 Vincent Legoll ;;; Copyright =C2=A9 2021 L=C3=A9o Le Bouter ;;; Copyright =C2=A9 2022 Christopher Baines +;;; Copyright =C2=A9 2023 Frank Pursel ;;; ;;; This file is part of GNU Guix. ;;; @@ -23,20 +24,72 @@ ;;; along with GNU Guix. If not, see . =20 (define-module (gnu packages java-xml) - #:use-module (gnu packages) + #:use-module ((guix licenses) #:prefix license:) #:use-module (gnu packages base) #:use-module (gnu packages check) #:use-module (gnu packages compression) #:use-module (gnu packages gtk) #:use-module (gnu packages java) - #:use-module ((guix licenses) #:prefix license:) - #:use-module (guix gexp) - #:use-module (guix packages) + #:use-module (gnu packages) + #:use-module (guix build-system ant) + #:use-module (guix bzr-download) #:use-module (guix download) + #:use-module (guix gexp) #:use-module (guix git-download) - #:use-module (guix build-system ant) + #:use-module (guix packages) #:use-module (guix utils)) =20 +(define-public java-jericho-html + (package + (name "java-jericho-html") + (version "3.4") + (source (origin + (method bzr-fetch) + (uri + (bzr-reference + (url + "http://jerichohtml.bzr.sourceforge.net/bzr/jerichohtml") + (revision (string-append "tag:" version)))) + (file-name (string-append name "-" version "-checkout")) + (sha256 + (base32 + "1zsf4i33jn05pma4y1658d6avhw7x4c12ggs96szhc06b7bxs8j0")) + (modules '((guix build utils))) + (snippet '(begin + (format #t "~%~a~%" "Removing sourced jar files.= ") + (for-each (lambda (jarf) + (delete-file jarf) + (format #t "Deleted: ~a~%" jarf)) + (find-files "." "\\.jar$")))))) + (build-system ant-build-system) + (arguments + `(#:jar-name (string-append ,name ".jar") + #:phases (modify-phases %standard-phases + (add-before 'build 'add-ant-env-options + (lambda* _ + (setenv "ANT_OPTS" "-Dfile.encoding=3Diso-8859-1") + (let ((match-str + (string-append "jerichohtml-" ,version))) + (substitute* "build.xml" + ((match-str) + ""))))) + (add-after 'build 'check-prep + (lambda* (#:key source #:allow-other-keys) + (mkdir-p "src/test/java")))))) + (native-inputs (list + java-commons-logging-minimal + java-junit + java-log4j-api + java-slf4j-api)) + (home-page "http://jericho.htmlparser.net/docs/index.html") + (synopsis "Java HTML Parser library") + (description + "This Java library allowing analysis and manipulation of +parts of an HTML document, including server-side tags, while +reproducing verbatim any unrecognised or invalid HTML. It also +provides high-level HTML form manipulation functions.") + (license (list license:lgpl2.1+ license:asl2.0 license:epl1.0)))) + (define-public java-simple-xml (package (name "java-simple-xml") base-commit: 0e480ca7b60428a62fc4681d7aca6c7c067add42 --=20 2.39.1