Really deep linking: Url + regex
secretGeek .:dot Nuts about dot Net:.
home .: about .: sign up .: sitemap .: secretGeek RSS

Really deep linking: Url + regex

Here's today's boil the ocean scheme idea — and it's not a new idea by any means. I'm sure it has occurred to many people at many times, I've just never seen it written down. (links to further reading welcome)

Just say you want to bookmark a particular paragraph on a particular web document.

Perhaps you want to perform the electronic equivalent of using a highlighter pen to point out a particular fragment of a document in its original context.

You can give the uri of the page, but you can't give a specific link to the actual paragraph you are interested in.

(Sure, if the author of the document provided a named anchor tag for the paragraph then you are in luck. But only semantic web fanboys and egomaniacs go to this kind of extreme)

What I'm thinking is that, instead, you could provide a little extra bit of regex-goodness in the url. Say for example you said:

href='http://secretGeek.net/regexUri.asp --highlight "Say for example you said:"'

...then this could act as a pair of instructions to the browser: the first one is "get from 'http://secretGeek.net/regexUri.asp' and the second instruction is "highlight the text that matches this regex: "Say for example you said:" (excluding the quotes). The instructions are separated by spaces, qualified by quotes and so on. Click on the link and the page is shown, with the relevant text highlighted.

Okay — i'm thinking four different things at once here:

  1. Why stop at highlight? what other commands could the new commandline accept?
  2. "Major security flaws waiting to happen"
  3. Some people, when confronted with a problem, think "I know, I'll use regular expressions." (etc...)
  4. Subtle differences in implementations between browsers.... NO!!!1!

And of course, as always, I'm hearing the voice of some strange ever-present character in my head (who sounds a lot like comic-book-store-guy from the simpsons) and he alternates between:

  • You can already do that in firefox with addins/greasemonkey, idiot. And
  • 'course, there'a an emacs command to do that

Thoughts? Additions? Subtractions? Jakob?





'Joseph Cooney' on Wed, 08 Jul 2009 09:17:49 GMT, sez:

You could, of course implement it unilaterally on your site in javascript. When it saw that kind of URL it could run a regex over the content of the page and do the highlighting of any matching elements.

Of course creating the URL in the first place would rely on end users learning regular expressions. While Jeff Friedl would probably love this to come to pass it doesn't seem likely to happen in the short to medium term.



'Dave' on Wed, 08 Jul 2009 13:18:28 GMT, sez:

A few random comments:

Since the space character has to be Url-encoded in links, you're relying on all browsers correctly splitting the Url and the "highlight" before making any requests - which is not backwards compatible with older browsers. And would result in a bunch of "href='http://secretGeek.net/regexUri.asp+--highlight+%22Say+for+example+you+said%3A%22%27"" requests and related server 404 errors.

I would propose re-using the existing fragment separator ('#') and making an additional constraint that the "highlight" is Url-encoded. That way, older browsers will just ignore it, while newer browsers can add the highlighting logic.

Of course, then we would have to add some way for those newer browsers to know what's a regular fragment and what is the new "highlight" fragment. Maybe it's treated as a regular fragment if it exists in the document, and as a highlight if the string doesn't match any element id's?



'shulmang' on Wed, 08 Jul 2009 14:17:04 GMT, sez:

while it's always nice to puzzle over a regex now and then, perhaps XPath would be good here? However, with XPath you would be specifying the markup rather than the content which may be less enduring.



'mat roberts' on Wed, 08 Jul 2009 14:53:37 GMT, sez:

I saw a related idea for permanent url linking on the horror a while ago http://www.codinghorror.com/blog/archives/000068.html
Not quite the same kind of thing your proposing...but



'Tom' on Wed, 08 Jul 2009 19:04:05 GMT, sez:

Adobe has actually had this built into their browser plugin for a little bit: http://partners.adobe.com/public/developer/en/acrobat/PDFOpenParameters.pdf

Of course, it's not always reliable and gets tripped up sometimes.



'Thom Lawrence' on Thu, 09 Jul 2009 04:33:54 GMT, sez:

There was/is an attempt to create a standard for this based on XPath:

http://www.w3.org/TR/WD-xptr



'David Andersson' on Fri, 07 Aug 2009 04:27:39 GMT, sez:

I've thought of this idea for a long time and think that properly executed it can be a hit. Myself though, I have still to figure out a good solution for it. But I often find I want to e-mail/message snippets to friends and colleagues with a highlight on the page.



'Textmarker' on Wed, 02 Sep 2009 13:56:57 GMT, sez:

Done! http://beta.marker.to/



'lb' on Wed, 02 Sep 2009 19:11:25 GMT, sez:

@TextMarker
interesting stuff.
It worked ok for me, though it didn't load any style sheets (which use relative urls) and relative links were also broke.
Plus, not seeing the original page's url anywhere was a bit disconcerting.
see http://beta.marker.to/25463
(I'd like it if there was an 'about' link at beta.marker.to ;-) I want to see who the geniuses were behind it)




name


website (optional)


enter the word:
 

comment (HTML not allowed)


All viewpoints welcome. But the right to delete any post for any reason is reserved. Don't make me do it. Aim for constructiveness. Comments may be republished, emailed to your loved ones or printed and used as toilet paper. Also, I get particularly nasty on comment spam. It's not worth even trying to post comment spam here -- your html is escaped, and your links are given a rel='nofollow'. By attempting to post a comment, you understand that if the comment is considered spam, at my absolute discretion, your IP address may be used as the target of a prolonged distributed denial of service attack. Your electricity might suddenly stop working. Your car tyres will go mysteriously flat. You will suffer permanent hairloss. Your dreams will be filled with terrifying monsters. And in any case I reserve the right to record and publish your IP address.

 

TimeSnapper is a life analysis system that stores and plays-back your computer use. It makes timesheet recording a breeze, helps you recover lost work and shows you how to sharpen your act.

 

NimbleText - FREE text manipulation and data extraction

NimbleText is a Powerful FREE Tool

Use it for:

  • extracting data from text
  • manipulating text
  • generating code

It makes you look awesome. Use it right now! Go on! Hurry! Don't walk, run!

 

Articles

Mind-boggling Demo of New Gaming Genre, aka Folder-Based Hangman, aka Fun with Recursion Mind-boggling Demo of New Gaming Genre, aka Folder-Based Hangman, aka Fun with Recursion
Got CSV in your javascript? Use agnes. Got CSV in your javascript? Use agnes.
I went to write down a book name and founded an internet empire instead. I went to write down a book name and founded an internet empire instead.
NimbleText: Origins NimbleText: Origins
The Windows 8 Mullet The Windows 8 Mullet
Cosby: spontaneous striped background generator Cosby: spontaneous striped background generator
Slides from WDCNZ: Live Coding Asp.net MVC3 Slides from WDCNZ: Live Coding Asp.net MVC3
MVC 3, MVC 3, "Third Times a Charm" references
Custom Errors in ASP.Net MVC: It couldn't be simpler, right? Custom Errors in ASP.Net MVC: It couldn't be simpler, right?
Anatomy of a Domain Hijacking, part 2: The Website Who Came In From The Cold Anatomy of a Domain Hijacking, part 2: The Website Who Came In From The Cold
Anatomy of a Domain Hijacking, part 1 Anatomy of a Domain Hijacking, part 1
secretGeek.net domain has been stolen. The site may go down. secretGeek.net domain has been stolen. The site may go down.
Boring article: 'untrusted domain' issue with SQL Server. Boring article: 'untrusted domain' issue with SQL Server.
Coding While You Commute Coding While You Commute
Test Driven Dentistry Is A Good Thing Test Driven Dentistry Is A Good Thing
The 'less crashy' release of NimbleText The 'less crashy' release of NimbleText
Rethinking Toolbars in Visual Studio (or any IDE) Rethinking Toolbars in Visual Studio (or any IDE)
Where shall we have lunch? Where shall we have lunch?
Setting up email for your microIsv Setting up email for your microIsv
The NO Visual Studio movement: Compiling .net projects in Notepad++ The NO Visual Studio movement: Compiling .net projects in Notepad++
ZeroOne: the editor for programmers who think in binary ZeroOne: the editor for programmers who think in binary
Mercurial workflow for personal projects (with a .net bias) Mercurial workflow for personal projects (with a .net bias)
I see you're using vim. Let me fix that for you. I see you're using vim. Let me fix that for you.
The worst recruitment spam I've ever read The worst recruitment spam I've ever read
A thank you I forgot to say A thank you I forgot to say
My new product, NimbleText, is live My new product, NimbleText, is live
Grabbing the free songs of Jonathan Coulton (with Powershell) Grabbing the free songs of Jonathan Coulton (with Powershell)
Using NimbleSet to compare lists Using NimbleSet to compare lists
Wanted: Wiki Lists (dot org) Wanted: Wiki Lists (dot org)
DOS on Dope: The last MVC web framework you'll ever need DOS on Dope: The last MVC web framework you'll ever need
JSON Query Languages: 5 special purpose editors JSON Query Languages: 5 special purpose editors
What then, is b? What then, is b?
SQLike: A simple editor SQLike: A simple editor
Yet Another BizPlan Generator. Yet Another BizPlan Generator.
HOT GUIDS: A hot or not site for guids HOT GUIDS: A hot or not site for guids
How does life get better? One tiny hack at a time. How does life get better? One tiny hack at a time.
24 things to do, and 100 things *not* to do (yet) for building a MicroISV 24 things to do, and 100 things *not* to do (yet) for building a MicroISV
Venture capital won't kill Jeff Atwood, it will only make him Jeffer. Venture capital won't kill Jeff Atwood, it will only make him Jeffer.
A handy workflow image for newbie mercurial users A handy workflow image for newbie mercurial users
Fractal Feedback, a diversion into recreational programming Fractal Feedback, a diversion into recreational programming
Hump-Jumping: How the Education of Computer Science can be Saved, err, maybe. Hump-Jumping: How the Education of Computer Science can be Saved, err, maybe.
Suggested User Experience Improvements for DiffMerge Suggested User Experience Improvements for DiffMerge
SQL Style Extensions for C# SQL Style Extensions for C#
The Movie Hollywood (And My Wife) Doesn't Want You To See: Weekend at Jacko's The Movie Hollywood (And My Wife) Doesn't Want You To See: Weekend at Jacko's
Sysi: the ultimate administrators toolkit Sysi: the ultimate administrators toolkit

Archives .: secretGeek :: Complete Archives
TimeSnapper -- Automated Screenshot Journal TimeSnapper.com    
Version 3.3: true productivity boost

Next Action NextAction
Managing the top of your mind

NimbleText -- World's Simplest Code GeneratorNimbleText -- World's Simplest Code Generator, Text Manipulator, Data Extractor

25 steps for building a Micro-ISV 25 steps for building a Micro-ISV
3 minute guides -- babysteps in new technologies: powershell, JSON, watir, F# 3 Minute Guide Series
Universal Troubleshooting checklist Universal Troubleshooting Checklist
Top 10 SecretGeek articles Top 10 SecretGeek articles
ShinyPower (help with Powershell) ShinyPower
Now at CodePlex

Realtime CSS Editor, in a browser RealTime Online CSS Editor
Gradient Maker -- a tool for making background images that blend from one colour to another. Forget photoshop, this is the bomb. Gradient Maker


[powered by Google] 


How to be depressed How to be depressed
You are not inadequate.



Recommended Reading


the little schemer


The Best Software Writing I
The Business Of Software (Eric Sink)

Recommended blogs

Jeff Atwood
Joseph Cooney
Phil Haack
Scott Hanselman
Julia Lerman
Rhys Parry
Joel Pobar
Thomas White
OJ Reeves
Eric Sink

Aggregated Links

proggit
dzone
hacker news
dot net kicks

Human Link Machines

interesting finds
a continuous learner's weblog
arjan's world
weekly link post

LinkedIn profile
LogEnvy - event logs made sexy
Computer, Unlocked. A rapid computer customization resource
PC Smart Buys - Computer Hardware in Australia
 
home .: about .: sign up .: sitemap .: secretGeek RSS .: © Leon Bambrick 2006 .: privacy

home .: about .: sign up .: sitemap .: RSS .: © Leon Bambrick 2006 .: privacy