Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Unlock Articles with Paywallskip (paywallskip.com)
55 points by francocanzani on Sept 2, 2024 | hide | past | favorite | 62 comments


Hi! Our project addresses the limitations of existing paywall bypass tools by implementing a dynamic, community-driven approach. Key features include:

Real-time Adaptive Blacklist:

Constantly updated database of paywalled sites and effective bypass methods User-driven reporting system for quick adaptation to paywall changes Significantly faster response to new paywalls compared to static solutions

Multi-Method Bypass Arsenal:

Unlike single-method solutions (e.g., 12ft.io's cache access), we employ various techniques Methods include: User-Agent spoofing, Referer header manipulation, JS disabling post-load, and web archive fallbacks (Wayback Machine, archive.is, Google cache) Our blacklist determines the most effective method per site, improving success rates

Site-Specific Solutions:

Tracking individual websites allows for custom bypass methods when general approaches fail Parsed and validated HTML output ensures content integrity

We believe this approach offers a more robust and adaptable solution to paywall bypassing. We're eager to hear the community's thoughts and potential improvements.


Doesn’t seem to work on wsj, which is the only site archive.is doesn’t work on consistently anymore. So why not just use archive.is?


Will have a look. It worked earlier today.


wsj has easy-to-bypass paywall but hard-to-bypass antibot/antiscraping solution, so the browser extensions work better than hosted bypassers


That bypass paywalls chrome extension has never worked for me on wsj, even on a fresh chromium install. What extension is supposed to work?


Is this open or shared source?

Need help?


I'm really doubtful on this because most of these open source apps get banned from Github. Probably will make a dummy account to open issues and always have a copy local and in Gitlab. Should be able to clean the code a bit and share it soon.



Will have a look. Thanks


Even if you can’t host it on a web-based platform, a git repo accessible over SSH would be sufficient. I’m happy to submit patches via email :)


What will you do when the lawyers come for you?


The legal landscape surrounding this issue remains ambiguous. I've documented my analysis in the legal section of my website. Typically, the consequence is domain takedowns, which is why I proactively purchased 10 domains as a precautionary measure.

https://www.paywallskip.com/posts/legal


Get a .is domain potentially. They kinda don’t really care about these takedown requests IIRC.


:D


I’d like something like this, but instead of bypassing the paywalls and stealing the articles it’d pay a few cents for each article and charge me at the end of the month.

I really, really want to pay but I really, really don’t want to set up a subscription to read a one-off article.


You are not stealing anything, unless you find a way to delete the source article after you downloaded it. It might break copyright rules. I wouldn't recommend doing either though.


Hmm, I think the victims of organized corporate espionage might see this completely different. Its totally enough if "agents" of a certain country known for not really caring about copyright copy the plans for your new product. They dont have to delete yoour original plans to do damage.


The damage is different. In actual historical stealing use of the item is a zero sum game soof you take it you deprive use of the item to the person deemed the rightful owner (no comment or judgemebt on how that deeming happened for the purpose of this comment). This is breech of contract. It is not a zero sum game. There is no deprivation of use for the person deemed the owner. Instead there is an ambiguous effect on revenue because you followed the natural market outcome for information and distributed it freely. Maybe you decreased revenue potential because the person would have otherwise paid, maybe the person would have otherwise gone without the information so no revenue effect but deadweight loss to society of that person acting in a less informed way, maybe you increased revenue because the one freebie caused a purchase that otherwise wouldn't have happened. It is situation dependent but in any situation it is not correct to call this stealing despite the marketing efforts of copyright owners to get it mislabelled as such.


Blendle used to provide it, but alas, no more:

https://en.m.wikipedia.org/wiki/Blendle


I'm sure there was a startup a few years ago trying to achieve this. I can't recall the name sadly, it was 10+ years ago. Agar or something perhaps?

I believe you topped up a balance and if the site supported it could access per-article resouces for a one off fee. For this to really work it needs to be site independant however.


Nobody's every been prepared to try microtransactions that are actually micro.

I'd like a browser addon that warns of paywalled links before I click them, perhaps adds a padlock icon or dollar sign (maybe with required subscription/cost on hover), to avoid nonsense like 'the first paragraph or two is free, then you hit the paywall'.

Even better if it can do the same for region-locked sites and adblocker-blocking sites, or if on mobile, sites that nag/force you to use apps instead of the web...


I thought the Brave browser would provide such a solution. I was very disappointed to find out it's not an intended use case.


Not a fan of paywalls, but the framing of the pricing comparison is just so funny to me.

Like the guy who comes into the pub offering to sell dubiously acquired meat for a bargain price.


“Is using PaywallSkip legal?” is in the FAQs, but not “Is using PaywallSkip ethical?”

Why do we feel so entitled to other people’s work as soon as it’s published on the internet?

When Homer Simpson reads magazines at the convenience store without paying, we understand that he’s behaving poorly. Why do we feel differently now?


I tried to subscribe to an online newspaper in my country.

Subscription was very easy. When I wanted to end it, they made it purposefully hard for me to do. I think it's unacceptable, and should be outlawed: the process to unsubscribe should be as hard (or easy) as the one to subscribe.

Since newspapers (at least that specific one) have no problem doing this as long as it's legal, why should the public be more observant of ethics?


If Revolut is available in your country, create a new virtual card for each subscription and when you can't/don't want to pay them anymore just cancel the card in Revolut.


Also works with wise (used to be transfer wise).


Because the corner store has, in many cases, become a gigantic multinational monster that exploits the authors and the readers.


> Why do we feel so entitled to other people’s work as soon as it’s published on the internet?

Because intellectual property is a broken and harmful system.


Its the old leftist anarchism attitude, stealing from someone bigger then you is no crime. Why? Because the system is evil anyways, so no need to play nice. Its starts with people grabbing a newspaper from the stand, and not throwing in the coins. And from there, its uphill... Some people think its totally fine to install a specific app to bypass youtube ads, instead of buying premium. 30 years ago, these people were cracking pay tv with their computers. And it was cool!

P.S.: I am only reporting, this is not my attitude.


> stealing from someone bigger [than] you is no crime.

Although it's finely said, in a related worldview, I would adapt it to say:

> stealing from a publicly traded corporation is not a crime because they're not a person anyway

This takes away the pain inflicted on someone when stealing from that individual. Traded corporations carry no pain.


>some people think it’s okay to install a specific app to bypass youtube ads

My god man with the way those things have been trending it makes you a saint lol


> When Homer Simpson reads magazines at the convenience store without paying

A more specific example would be ripping off the plastic packaging to read a magazine. The ones without extra packaging are designed and for a long time expected to be browsed.


Are you considering a chrome extension to automate the process from the client perspective?


Yes, is being developed and will launch this week. Just click and go, nothing fancy.


not every hero wears a cape


"but all villains do"

(This quip means that if a demagogue aggrandizes themself as a hero or savior, you should be suspicious of whether they really are as saintly as they claim to be.)

Edit:

I'm not saying this quip about paywallskip, which I think is a kindhearted service because it helps reduce the information gap, or digital divide, between the middle class and those stuck in poverty.

Rather, the joke/warning is about demagogues in politics or in wealthy churches, stuff like that.

Not about this post.

Paywallskip never called themselves a hero - an appreciative user did, which is authentic and sweet.

I think no one should ever call themselves a hero.

Just help people, like Paywallskip is doing.

I guess that what real heroes always say, both in movies and in real life, is "I was just doing my job". See e.g. this school bus driver who saved a child's life:

https://www.youtube.com/shorts/xnW3NHbMTRs

(short, inspiring video)


Seems to be broken. Get a lot of server side error messages


Do you also support skipping the paywall for services like AWS as well? That would be useful.


That would take the fun out of AWS


Does paywallskip scrape archive.is and archive.org?


It fallbacks to archives, yes. Basically I use different User-Agent headers, different Referer headers, it tries disabling javascript once the page has loaded and the fallback is to fetch from web archives (Wayback Machine, archive.is, Google cache).

Then the HTML is validated and parsed.


This is going to be a shameless plug here.

I'm also frustrated by how paywalls work, having to endlessly subscribe to yet another newspaper or newsletter, just because you want to read one article.

But this approach is... stealing. When it comes to big media like the NYT, it might not feel like it is, but at the end of the day, it's people doing the background research, writing the content, editing, etc.

This is _not_ the solution.

Our alternative in the making offers a legal, fair alternative: buy and sell content by the piece.

It's available at https://vernus.one

If you have a blog or newsletter, or if you're a journalist, I'm happy to help in setting up your publication and start earning in a transparent manner.


Disputing the space and then resolving the mess is a tried and true tactic in this industry.

Just because this site isn’t directly paying the content providers now doesn’t mean they won’t in the future. Providing it free, even with ambiguous legality, can allow them to build enough of a following to give them leverage in future negotiations.


It's not directly not paying, it's not paying at all.

I don't see how this is different from disputing the banking sector by conducting a heist.


> I don't see how this is different from disputing the banking sector by conducting a heist.

Let's leave aside this logical fallacy; we're all adults here.

Buying music on iTunes became popular because it was easier than pirating the music. You could buy individual songs for less than 99 cents (you still can do that [1])

News outlets have the option of selling content by the piece (as you suggest) instead of forcing you to go into a monthly or annual subscription you don't need because you just want to read 1 or 2 articles per month from a particular newspaper.

However, they don't want to do so. And because of that, pirating the content becomes again more convenient; like in the pre-iTunes years.

Your idea of using an intermediary service to get that content isn't the solution. I'm not interested in a third party profiling me based on the content I read online.

Edited to add reference: [1] https://support.apple.com/en-us/109338


There's no profiling going on, we take a cut of the sale price and that is it. But I understand your concern and will think hard about how we could make this more obvious. I'm not sure if it's a general concern or it applies more to people like us.

Also, I assume you read stuff on Substack, Medium, Reddit, etc. Definitely so on a third party commonly known as HN.

Re iTunes: it's not about the will of the news outlets. It is impractical for them to offer a model which sells by the piece. And the most prominent cause is banal: payment fees are too high a percentage on what would be a typical price for a single piece of content.

Keeping a credit balance solves that, but keeping a credit balance at every outlet is just as bad as monthly subs.

An aggregator can solve this issue.


I agree with your take, but pitching your own solution as part of it significantly lowers the appeal of your comment, at least for me.


I couldn't really care less.

If you look at the middle part of the domain between news. and .com, it might remind you that this is a community of people building stuff, learning to sell it, improving upon the feedback.

I don't really see how pitching a legal solution in a thread below the pitch of an obviously illegal, or at least unethical one is problematic.


Fair enough. It just felt salesy, that’s all I‘m saying.


Why? This is HN no? The entire point of this community is to provide technology solutions (mostly through companies). If any place should welcome this, it should be HN.


You are right of course. I think it was something in the wording that turned me off. It was an instinctive reaction that I now regret.


Too salesy


My comment or the site? If the former, I acknowledge that, I got a bit triggered by YACTS (yet another content theft service). If the latter, I'm genuinely curious and happy to make changes to our copy.

Paywallskip seems to stealsy to me FWIW.


Can I get my money back if it turns out that some NYT article was outright lies, probably already paid for by some undisclosed shady entity?


Do you get your money back if you go to the store and buy some new food/fruit/snack you don't like the taste of?

No, you throw it away, and probably won't buy it again. If you don't like NYT, don't buy from them.

If NYT is like an avocado for you, sometimes ripe and delicious, sometimes unripe, sometimes rotten, you get to decide how often you're gonna buy avocados, or if you'll develop your own methods of avocado testing before buying to increase your odds. In no case do you get to take the avocado skin back to the store asking for a refund.


I think the original question has some logic.

If the content is untruthful, it isn't in analogy with a fruit that you don't like, it's in analogy with a rotten fruit.

I expect a refund if a rotten fruit is delivered.

Really hard to implement with content.


Perhaps a simpler analogy, you see a new bag/flavour of chips in the store, "super crunchy" "delicious", you buy it, go home, tastes horrible, barely crunchy, do you get to take it back and get your money back?


Yep! Local stores are 'happy' to refund since they tend to value customers. Megacorps... not so much.


Misleading advertising is so common, I wouldn't really fret over it.

If it says super crunchy and you assess the crunchiness is merely average, well, that's your perception.

But if it says it's paprika flavoured and it turns out to be sauerkraut, you gotta bring it back to the store.


Trader Joe’s will give you your money back if you buy something and don’t like it.


I wish you could. I'd like Vernus to champion indie journalism, mainstream media is dying before our eyes anyway.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: