Product Hunt logo dark
  • Launches
    Coming soon
    Upcoming launches to watch
    Launch archive
    Most-loved launches by the community
    Launch Guide
    Checklists and pro tips for launching
  • Products
  • News
    Newsletter
    The best of Product Hunt, every day
    Stories
    Tech news, interviews, and tips from makers
    Changelog
    New Product Hunt features and releases
  • Forums
    Forums
    Ask questions, find support, and connect
    Streaks
    The most active community members
    Events
    Meet others online and in-person
  • Advertise
Subscribe
Sign in
Subscribe
Sign in
Golden Hill Software

Golden Hill Software

Developer of Unread and Feed Hawk

2 followers

Developer of Unread and Feed Hawk

2 followers

Visit website
Developer of Marcato and CloudPull. For support, please email: support@goldenhillsoftware.com
  • Overview
  • Launches2
  • Reviews
  • Alternatives
  • Team
  • More
Company Info
goldenhillsoftware.com
Golden Hill Software Info
Launched in 2016View 2 launches
Forum
p/golden-hill-software
  • Blog
  • •
  • Newsletter
  • •
  • Questions
  • •
  • Forums
  • •
  • Product Categories
  • •
  • Apps
  • •
  • About
  • •
  • FAQ
  • •
  • Terms
  • •
  • Privacy and Cookies
  • •
  • X.com
  • •
  • Facebook
  • •
  • Instagram
  • •
  • LinkedIn
  • •
  • YouTube
  • •
  • Advertise
© 2025 Product Hunt
SocialX
This is the 2nd launch from Golden Hill Software. View more

Webpage Text API

Get the HTML content of a webpage without the junk.
The Webpage Text API is a cloud service that lets you easily retrieve the HTML for the content of a webpage without the junk (chrome, navigation, ads, and scripts) that tends to clutter modern webpages.
Webpage Text API gallery image
Webpage Text API gallery image
Webpage Text API gallery image
Payment Required
Launch tags:
Web App•API
Launch Team
John Brayton

What do you think? …

John Brayton
John Brayton
Golden Hill Software

Golden Hill Software

Maker
The Webpage Text API has been powering the webpage text feature of Unread, my RSS reader, since February 2020. It is perfect for RSS readers, read later services, browser extensions, newsbots, and other applications where the user wants the content of the webpage without the cruft. I started developing the Webpage Text API for Unread in 2018, before Mercury Parser went open source. At the time Unread had webpage text retrieval capabilities powered by Readability.js. That worked well, but I needed the ability to cache webpage text and associated images ahead of time. It was impractical to generate webpage text for thousands of articles at a time on-device, so I researched server-based options. At that time Mercury Reader provided an API and generously made it available for free. However their terms of service would not allow Unread to aggressively cache webpage text for articles ahead of time. The Mercury Parser source code had not yet been made public. I looked into commercial options, but none fit my needs. So I started writing my own server-based system. I started by incorporating the heuristics used by Readability.js. I then added test cases from hundreds of different websites to improve the webpage text quality. After Mercury Parser went open source, I evaluated whether it would be more suitable for generating webpage text for Unread. I discovered that I got higher quality results from my own Webpage Text API than I would from Mercury Parser. This inspired me to continue improving the Webpage Text API, and to now offer it as a commercial product.
Report
4yr ago
Aravs
Aravs
Knibble.AI

Knibble.AI

This is great. Thanks for building this. I'd love to see free tier to try it out and a pay as you go pricing model. The pricing seems to be high.
Report
4yr ago
John Brayton
John Brayton
Golden Hill Software

Golden Hill Software

Maker
@aravs7 Thank you. I am happy to give trial codes for folks to start developing against it, and to hold off charging until a customer starts using it in a production environment.
Report
4yr ago
Aravs
Aravs
Knibble.AI

Knibble.AI

@john_brayton That's great, How do I get the trial code?
Report
4yr ago
John Brayton
John Brayton
Golden Hill Software

Golden Hill Software

Maker
@aravs7 Contact me at sales@goldenhillsoftware.com. I will need to know which price plan you want to try and the product/service name you are working on. (This can be changed later.)
Report
4yr ago
John Alex
John Alex
how can i embed this feature in my website. my website Address: https://speakingbusiness.co.uk
Report
3yr ago
Real-time insights by Redis
Real-time insights by Redis — Debug and monitor for free.
Debug and monitor for free.
Promoted

Golden Hill Software Launches

Webpage Text API
Webpage Text API Get the HTML content of a webpage without the junk.

Launched on August 25th, 2021

Do you use Golden Hill Software?

Pros
Cons
Reviews
Helpful

You might also like

Nanonets
AI-Powered Document Processing and Workflow Automation
Base64.ai
Base64.ai
Extract text, data, photos and more from all types of docs
Teutonic CSS
Here’s 12KB of CSS to jump start your HTML
Quote Block
Quote Block
Extract text from books/screenshots, make notes, learn ..
EasyOCR
EasyOCR
NoCode AI as a service, automate text extraction from images
SelectorsHub
SelectorsHub
The Next Gen XPath & cssSelector IDE
View more
Review Golden Hill Software?Be the first to review Golden Hill Software