I'm a bit thicker than usual this morning. Could you elucidate?Denny wrote:the bots will have the same issues as the lichen filter...
Use of the word "P0RN" for display of pics of inst
- Nanohedron
- Moderatorer
- Posts: 38239
- Joined: Wed Dec 18, 2002 6:00 pm
- antispam: No
- Please enter the next number in sequence: 8
- Tell us something.: Been a fluter, citternist, and uilleann piper; committed now to the way of the harp.
Oh, yeah: also a mod here, not a spammer. A matter of opinion, perhaps. - Location: Lefse country
Honey, you ain't thick, ya just ain't spent no time there.
both are software search algorithms that are looking for specific characters or a group of characters (usually with white space on either side, like - blank "p" "o" "r" "n" blank -).
bots would also be looking for .com, .net... a bit cuter is - blank BunchOfStuff "@" BunchOfSruff ".com" blank -
pBB has a bit of software that reads this junk from the box we type it into and formats it for the page (or the preview panel) C&F has added a scan for the string - blank "p" "o" "r" "n" blank - and replaces it with lichen.
notpornstring <--- no blankies
both are software search algorithms that are looking for specific characters or a group of characters (usually with white space on either side, like - blank "p" "o" "r" "n" blank -).
bots would also be looking for .com, .net... a bit cuter is - blank BunchOfStuff "@" BunchOfSruff ".com" blank -
pBB has a bit of software that reads this junk from the box we type it into and formats it for the page (or the preview panel) C&F has added a scan for the string - blank "p" "o" "r" "n" blank - and replaces it with lichen.
notpornstring <--- no blankies
- Nanohedron
- Moderatorer
- Posts: 38239
- Joined: Wed Dec 18, 2002 6:00 pm
- antispam: No
- Please enter the next number in sequence: 8
- Tell us something.: Been a fluter, citternist, and uilleann piper; committed now to the way of the harp.
Oh, yeah: also a mod here, not a spammer. A matter of opinion, perhaps. - Location: Lefse country
I'm still lost.Denny wrote:Honey, you ain't thick, ya just ain't spent no time there.
both are software search algorithms that are looking for specific characters or a group of characters (usually with white space on either side, like - blank "p" "o" "r" "n" blank -).
bots would also be looking for .com, .net... a bit cuter is - blank BunchOfStuff "@" BunchOfSruff ".com" blank -
pBB has a bit of software that reads this junk from the box we type it into and formats it for the page (or the preview panel) C&F has added a scan for the string - blank "p" "o" "r" "n" blank - and replaces it with lichen.
notpornstring <--- no blankies
I guess what I'm asking is that, in layman's terms, does the bypassing of the filter negate its function? IOW, instead of the intended "lichen", I see the other word. Does the search algorithm benefit, as it were, from this, too?
"If you take music out of this world, you will have nothing but a ball of fire." - Balochi musician
when in doubt ... test
http://chiffboard.mati.ca/viewtopic.php ... orn#626997
note: the lichen after the highlight= If I edit, and indeed when I pasted it, it is that 4 letter word.
it would appear that the search & replacement thing is run against the search criteria prior to the search being executed.
http://chiffboard.mati.ca/viewtopic.php ... orn#626997
note: the lichen after the highlight= If I edit, and indeed when I pasted it, it is that 4 letter word.
it would appear that the search & replacement thing is run against the search criteria prior to the search being executed.
- Nanohedron
- Moderatorer
- Posts: 38239
- Joined: Wed Dec 18, 2002 6:00 pm
- antispam: No
- Please enter the next number in sequence: 8
- Tell us something.: Been a fluter, citternist, and uilleann piper; committed now to the way of the harp.
Oh, yeah: also a mod here, not a spammer. A matter of opinion, perhaps. - Location: Lefse country
'Kay. I'm not getting an anwer to my question that I can understand yet, though. Sorry.Denny wrote:when in doubt ... test
http://chiffboard.mati.ca/viewtopic.php ... hen#626997
note: the lichen after the highlight= If I edit, and indeed when I pasted it, it is that 4 letter word.
it would appear that the search & replacement thing is run against the search criteria prior to the search being executed.
"If you take music out of this world, you will have nothing but a ball of fire." - Balochi musician
- kkrell
- Posts: 4838
- Joined: Mon Jul 29, 2002 6:00 pm
- antispam: No
- Please enter the next number in sequence: 8
- Tell us something.: Mostly producer of the Wooden Flute Obsession 3-volume 6-CD 7-hour set of mostly player's choice of Irish tunes, played mostly solo, on mostly wooden flutes by approximately 120 different mostly highly-rated traditional flute players & are mostly...
- Location: Los Angeles
- Contact:
I believe the point is being made that the search for a bad word (the precursor to 'lichen') looks for very specific instances of the word, and is easily beaten (for instance by substituting the number zero for the letter 'o'), as well as other common measures spammers use to get past email filters. Other services scanning the C&F forums may or may not be more sophisticated in identifying the bad word. So Google, its ads, office bad word filters, etc. might not be as easily fooled. So, C&F might be blocked from certain machines, in offices, libraries, etc. regardless of C&F administrators attempt to keep things a little more family friendly.
Kevin Krell
Kevin Krell
International Traditional Music Society, Inc.
A non-profit 501c3 charity/educational public benefit corporation
Wooden Flute Obsession CDs (3 volumes, 6 discs, 7 hours, 120 players/tracks)
https://www.worldtrad.org
A non-profit 501c3 charity/educational public benefit corporation
Wooden Flute Obsession CDs (3 volumes, 6 discs, 7 hours, 120 players/tracks)
https://www.worldtrad.org
The "filter" is only done on the way to the web page.
This is a sub task to the bit that translates the junk in the "Text area" (I's typing in one of them right now)
The tags (bold, size, color, quote, HTML ,etc.) are all parsed and changed to attributes, instead of text pictures are fetched....bunch o'stuff..."filter" is a piece if it.
This is a sub task to the bit that translates the junk in the "Text area" (I's typing in one of them right now)
The tags (bold, size, color, quote, HTML ,etc.) are all parsed and changed to attributes, instead of text pictures are fetched....bunch o'stuff..."filter" is a piece if it.
- kkrell
- Posts: 4838
- Joined: Mon Jul 29, 2002 6:00 pm
- antispam: No
- Please enter the next number in sequence: 8
- Tell us something.: Mostly producer of the Wooden Flute Obsession 3-volume 6-CD 7-hour set of mostly player's choice of Irish tunes, played mostly solo, on mostly wooden flutes by approximately 120 different mostly highly-rated traditional flute players & are mostly...
- Location: Los Angeles
- Contact:
Now I'm dense. Are you saying that the word substitution only occurs on display of the C&F thread to the person viewing it (a run-time conversion as it is viewed)? But the bad content is actually still in the database as it was typed? Thus, other functions that directly access the database of messages will still find the bad word (which might include search functions, 'bots, etc.)?Denny wrote:The "filter" is only done on the way to the web page.
This is a sub task to the bit that translates the junk in the "Text area" (I's typing in one of them right now)
The tags (bold, size, color, quote, HTML ,etc.) are all parsed and changed to attributes, instead of text pictures are fetched....bunch o'stuff..."filter" is a piece if it.
Kevin Krell
International Traditional Music Society, Inc.
A non-profit 501c3 charity/educational public benefit corporation
Wooden Flute Obsession CDs (3 volumes, 6 discs, 7 hours, 120 players/tracks)
https://www.worldtrad.org
A non-profit 501c3 charity/educational public benefit corporation
Wooden Flute Obsession CDs (3 volumes, 6 discs, 7 hours, 120 players/tracks)
https://www.worldtrad.org
I'm saying that based on how a couple of searches looked to me that the above is true.kkrell wrote:Are you saying that the word substitution only occurs on display of the C&F thread to the person viewing it (a run-time conversion as it is viewed)? But the bad content is actually still in the database as it was typed?
I do not believe that bots directly access the database. I think that they read the returned HTML, just like the browser.kkrell wrote:Thus, other functions that directly access the database of messages will still find the bad word (which might include search functions, 'bots, etc.)?
- rich
- i see what you did there
- Posts: 609
- Joined: Mon May 14, 2001 6:00 pm
- Please enter the next number in sequence: 1
- Location: Toronto, Ontario
- Contact:
(The quote talked about the moss.)Denny wrote:lichen
quote this post...
It's stored in the database as the unfiltered word because otherwise there'd be no way to make a word-filter affect previous posts without going through and changing them all when the filter is turned on, and there'd be no way to ever turn a word-filter off.
Bots and search engines don't see the database, they see the same thing you see. The only place I can think of where the unfiltered words appear is in the RSS feed, and that's a "bug" in that I think the author of the RSS feed plugin forgot to apply filtering.
- I.D.10-t
- Posts: 7660
- Joined: Wed Dec 17, 2003 9:57 am
- antispam: No
- Location: Minneapolis, MN, USA, Earth
Try doing a Google search on my exact words quoted by Nanohedron.
Google recognizes it, does their search engine work all that differently than a filter? I think that Nanohedron's initial concerns are valid. (Okay, rich answered the question with more authority, but I am a proof of concept kind of person)
Just to clarify, I believe that the automated substitution was put into place not to reduce spam bots, but because filtering software blocks sites that have bad words. As I understand it, my stunt would not allow some people to read page 2.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
As a side note I had found the feature when trying to figure out how to explain how to post images (see below). It works to break up a complex post where normal options like "Disable BBCode in this post" just won’t work.
[img]http://chiffboard.mati.ca/images/smiles ... le_144.gif[/img]
Google recognizes it, does their search engine work all that differently than a filter? I think that Nanohedron's initial concerns are valid. (Okay, rich answered the question with more authority, but I am a proof of concept kind of person)
Just to clarify, I believe that the automated substitution was put into place not to reduce spam bots, but because filtering software blocks sites that have bad words. As I understand it, my stunt would not allow some people to read page 2.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
As a side note I had found the feature when trying to figure out how to explain how to post images (see below). It works to break up a complex post where normal options like "Disable BBCode in this post" just won’t work.
[img]http://chiffboard.mati.ca/images/smiles ... le_144.gif[/img]
"Be not deceived by the sweet words of proverbial philosophy. Sugar of lead is a poison."