www.webdeveloper.com
Results 1 to 6 of 6

Thread: Regex for validating full URL

  1. #1
    Join Date
    Sep 2004
    Location
    Sri Lanka
    Posts
    486

    Question Regex for validating full URL

    Hi,

    I need to have a JS function which can validate full URLs against following conditions.

    1. Should contain either http:// or https:// (other protocols not relevant)
    2. May or may not have www
    3. Might end with domain tld
    4. Might end with a /
    5. Might end with a file name
    6. Might end with URL parameters
    7. Should support sub level folders (http://www.site.com/folder1/folder2/ , http://www.site.com/folder1/folder3/folder3/1.html) and file names if any

    I do not have a clear understanding on regex and examples I found from web doesnt seem to fit with my need.

    Any help on the matter is highly appreciated.

    Thank you.

  2. #2
    Join Date
    Sep 2004
    Location
    Sri Lanka
    Posts
    486
    Still with no solution

  3. #3
    Join Date
    Oct 2010
    Location
    Versailles, France
    Posts
    1,290
    It's easy to write the beginning of the regular expression with something like this
    Code:
    var rgx=/^https?:\/\/.*/;
    which capture all url beginning with http:// or https:// , but the other points do not constitute real conditions...
    Besides, there is a lot of top level domain (see this IANA page) which supposes probably a preliminary choice...

    An example with some top level domains
    Code:
    var urls="http://www.google.org/,https://www.google.us/,https://www.google.fr/,http://hp-comm.fr.msn.com/?ocid=iefvrt".split(',');
    var rgx=/(?=.*(org|com|eu|us|fr))^https?:\/\/.*/;
    
    for (var i=0;i<urls.length;i++) alert('RegExp : '+rgx+'\n\nUrl : '+urls[i]+'\nTest :'+rgx.test(urls[i]));
    Last edited by 007Julien; 10-03-2013 at 10:23 AM.

  4. #4
    Join Date
    Sep 2004
    Location
    Sri Lanka
    Posts
    486
    Hi Julien,

    Thank you very much for your reply.

    What if we take TLD part is just a string starting from a dot and having at least 2 characters (e.g. .lk , .pk , .tk domains) and not less than 5 (longest tld known to me is .travel )

    Best Regards

  5. #5
    Join Date
    Oct 2010
    Location
    Versailles, France
    Posts
    1,290
    Try this (with a charset="utf-8") :
    Code:
    var urls="http://www.google.org/,https://www.google.gb/,https://www.google.fr/,http://hp-comm.fr.msn.zw/".split(',');
    var rgx=/(?=.*\.(ad|ae|aero|af|ag|ai|al|am|an|ao|aq|ar|arpa|as|asia|at|au|aw|ax|az|ba|bb|bd|be|bf|bg|bh|bi|biz|bj|bl|bm|bn|bo|bq|br|bs|bt|bv|bw|by|bz|ca|cat|cc|cd|cf|cg|ch|ci|ck|cl|cm|cn|co|com|coop|cr|cu|cv|cw|cx|cy|cz|de|dj|dk|dm|do|dz|ec|edu|ee|eg|eh|er|es|et|eu|fi|fj|fk|fm|fo|fr|ga|gb|gd|ge|gf|gg|gh|gi|gl|gm|gn|gov|gp|gq|gr|gs|gt|gu|gw|gy|hk|hm|hn|hr|ht|hu|id|ie|il|im|in|info|int|io|iq|ir|is|it|je|jm|jo|jobs|jp|ke|kg|kh|ki|km|kn|kp|kr|kw|ky|kz|la|lb|lc|li|lk|lr|ls|lt|lu|lv|ly|ma|mc|md|me|mf|mg|mh|mil|mk|ml|mm|mn|mo|mobi|mp|mq|mr|ms|mt|mu|museum|mv|mw|mx|my|mz|na|name|nc|ne|net|nf|ng|ni|nl|no|np|nr|nu|nz|om|org|pa|pe|pf|pg|ph|pk|pl|pm|pn|post|pr|pro|ps|pt|pw|py|qa|re|ro|rs|ru|rw|sa|sb|sc|sd|se|sg|sh|si|sj|sk|sl|sm|sn|so|sr|ss|st|su|sv|sx|sy|sz|tc|td|tel|tf|tg|th|tj|tk|tl|tm|tn|to|tp|tr|travel|tt|tv|tw|tz|ua|ug|uk|um|us|uy|uz|va|vc|ve|vg|vi|vn|vu|wf|ws|испытание|қаз|срб|테스트|中国|中國|測試|укр|香港|δοκιμή|إختبار|台湾|台灣|мон|الجزائر|عمان|ایران|امارات|پاکستان|الاردن|بھارت|المغرب|السعودية|سودان|مليسيا|xxx|ye|yt|za|zm|zw))^https?:\/\/.*/;
    
    for (var i=0;i<urls.length;i++) alert('regExp : '+rgx+'\n\n url : '+urls[i]+'\ntest :'+rgx.test(urls[i]));
    Which is only an extract of the IANA list (I remove some strange domains amending the code of the next vertical bar).
    Good luck !

    NB : Since the assertion sub-pattern for a point and at least 2 characters and not less than 6 characters (like museum and travel ) is : (?=.*\.[a-z]{2,6}). We search and not capture (?=) one or several (the star *) character of any type (the dot .) followed by a dot (with a backslash \.) and from 2 to 6 letters.
    Last edited by 007Julien; 10-03-2013 at 12:12 PM.

  6. #6
    Join Date
    Sep 2004
    Location
    Sri Lanka
    Posts
    486

    Smile

    Thanks a lot

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
HTML5 Development Center

"

"

X vBulletin 4.2.2 Debug Information

  • Page Generation 0.15572 seconds
  • Memory Usage 2,897KB
  • Queries Executed 13 (?)
More Information
Template Usage (33):
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_global_above_footer
  • (1)ad_global_below_navbar
  • (1)ad_global_header1
  • (1)ad_global_header2
  • (1)ad_navbar_below
  • (1)ad_showthread_firstpost_sig
  • (1)ad_showthread_firstpost_start
  • (1)ad_thread_first_post_content
  • (1)ad_thread_last_post_content
  • (3)bbcode_code
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)headinclude_bottom
  • (6)memberaction_dropdown
  • (1)navbar
  • (4)navbar_link
  • (1)navbar_moderation
  • (1)navbar_noticebit
  • (1)navbar_tabs
  • (2)option
  • (6)postbit
  • (6)postbit_onlinestatus
  • (6)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available (6):
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files (26):
  • ./showthread.php
  • ./global.php
  • ./includes/class_bootstrap.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/functions_navigation.php
  • ./includes/class_friendly_url.php
  • ./includes/class_hook.php
  • ./includes/class_bootstrap_framework.php
  • ./vb/vb.php
  • ./vb/phrase.php
  • ./includes/functions_facebook.php
  • ./includes/functions_calendar.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_notice.php
  • ./packages/vbattach/attach.php
  • ./vb/types.php
  • ./vb/cache.php
  • ./vb/cache/db.php
  • ./vb/cache/observer/db.php
  • ./vb/cache/observer.php 

Hooks Called (70):
  • init_startup
  • friendlyurl_resolve_class
  • init_startup_session_setup_start
  • database_pre_fetch_array
  • database_post_fetch_array
  • init_startup_session_setup_complete
  • global_bootstrap_init_start
  • global_bootstrap_init_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • load_show_variables
  • load_forum_show_variables
  • global_state_check
  • global_bootstrap_complete
  • global_start
  • style_fetch
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • strip_bbcode
  • friendlyurl_clean_fragment
  • friendlyurl_geturl
  • forumjump
  • cache_templates
  • cache_templates_process
  • template_register_var
  • template_render_output
  • fetch_template_start
  • fetch_template_complete
  • parse_templates
  • fetch_musername
  • notices_check_start
  • notices_noticebit
  • process_templates_complete
  • friendlyurl_redirect_canonical
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • bbcode_parse_start
  • postbit_imicons
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • memberaction_dropdown
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • build_navigation_data
  • build_navigation_array
  • check_navigation_permission
  • process_navigation_links_start
  • process_navigation_links_complete
  • set_navigation_menu_element
  • build_navigation_menudata
  • build_navigation_listdata
  • build_navigation_list
  • set_navigation_tab_main
  • set_navigation_tab_fallback
  • navigation_tab_complete
  • fb_like_button
  • showthread_complete
  • page_templates