Netgate Discussion Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Search
    • Register
    • Login

    Squid Blacklist url w/ path

    Scheduled Pinned Locked Moved pfSense Packages
    3 Posts 2 Posters 8.5k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • N
      NoahVail
      last edited by

      Hello everyone.  The most relevant information is in bold.

      **I'm trying to block the seedier parts of Craigslist, by using the Squid ProxyServer -> Access Control -> Blacklist option.

      Examples of the URLs I'm trying to block are…

      http://.craigslist.org//w4m
      http://.craigslist.org/m4w/**

      …you get the idea.

      (an actual url might be http://denver.craigslist.org/m4w/1895233724.html)

      I don't have a comprehensive understanding of Squid Expressions.

      Some examples of what I've tried are…

      craigslist.org/m4m
      .craigslist.org/m4m
      .craigslist.org/m4w/
      craigslist.orgm4w
      craigslist.org\m4w
      craigslist.org.m4w
      craigslist.org/./m4w
      craigslist.org/
      /m4w
      craigslist.org/m4w
      craigslist.org/m4w

      .craigslist.org/(m4w)/

      ...and countless other variants.

      Now I realize that my syntax choices have been incorrect, even wildly off.  But I haven't been able to figure out correct wildcard usage, when my URL contains a path.

      I've read in other posts that the '.' functions as a wildcard.  Maybe it's just for white/black listing users, I don't know.  I haven't figured out how to use it appropriately for a blacklisted URL.

      samplecity.craigslist.org/* is the same as samplecity.craigslist.org.
      So I don't really know if the '*' is having any effect on the url path.

      I know the '' works in the domain name.  The entry; samplecity..org is effective.

      **But for the life of me I can't figure out how to block http://newyork.craigslist.org/mnh/m4w/

      (without blocking all of newyork.craigslist.org/mnh, of course)

      Does someone here know what I'm doing wrong?**

      Thanks.
      NV

      Edit:Build and Ver info:
      PFSense 2.0-BETA3
      built on Wed Jul 21 01:06:25 EDT 2010
      FreeBSD 8.1-RC2

      Squid  2.7.8

      Thanx.
      NV

      1 Reply Last reply Reply Quote 0
      • jimpJ
        jimp Rebel Alliance Developer Netgate
        last edited by

        It's a regex, iirc, so for "any number of characters (or none)" use .*, not *

        So try this:

        example.com/.*/w4m

        Remember: Upvote with the 👍 button for any user/post you find to be helpful, informative, or deserving of recognition!

        Need help fast? Netgate Global Support!

        Do not Chat/PM for help!

        1 Reply Last reply Reply Quote 0
        • N
          NoahVail
          last edited by

          @jimp:

          It's a regex, iirc, so for "any number of characters (or none)" use .*, not *

          So try this:

          example.com/.*/w4m

          Thank you; I'll will try that.

          Since I posted, I was able to install squidGuard and compose a Destination List;
          by following the instructions on the squidGuard Quick Start page http://diskatel.narod.ru/sgquick.htm.

          That allowed me to selectively block just the dating and inappropriate portions of Craigslist.

          I do have Shalla's Blacklist installed, but it's 'hard coded' for just one or two Craigslist cities.

          I'll try your wildcard syntax and see if it works with a squidGuard destination list.  Right now, my Dest List uses a long set of URLs and I'll have to add another one for each additional city.  If your suggestion works there as well, the same URL set will work for every Craigslist city.  I'd forward the entries to Shalla SS; for integration into his list.

          I'll also be using your syntax, where I originally intended.  Don't know for what, but something will pop up soon enough.

          Your response is appreciated.

          NV

          1 Reply Last reply Reply Quote 0
          • First post
            Last post
          Copyright 2025 Rubicon Communications LLC (Netgate). All rights reserved.