Are GET variables used in Google's search engine? [closed] - search-engine

Closed. This question is off-topic. It is not currently accepting answers.
Want to improve this question? Update the question so it's on-topic for Stack Overflow.
Closed 10 years ago.
Improve this question
Say i have a site with movies. Now each movie doesn't have each own page, but passing the id or the name of the movie to one specific page, i.e movies.php, the data from the database are loaded in that page..
Now if i search in google a specific movie will my site be listed with that movie in the results??
In other words, if i have that movie stored in my database, but don't have an individual page for it, will it appear in search engines just by getting the GET variable, i.e $movie_name, passed in the movies.php??
Thank you!

If each of your movies has its own page, i.e. movies.com/movies.php?movie=half_baked
then that will count as a unique page.
You need to give Google a way of differentiating between different pages.
you can search google for your website by typing site:yoursite.com. That will show you each page google has indexed.
look into canonical pages http://support.google.com/webmasters/bin/answer.py?hl=en&answer=139394

Related

Solution for voting without attachment to a model? [closed]

Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 8 years ago.
Improve this question
My application is one page contract. I would like users to be able to sign the contract by clicking on a button. The contract is not a resource; it's plain HTML.
Every solution I have found so far relies on having a model that acts as votable. How can I implement a simple button that users may only click once, and display the number of users who have clicked it?
It's not possible to do this with static pages, at least not in a way that is clean and secure.
Think about it this way: every user is looking at a copy of the contract, which is being displayed to them on their browser (the client). If you want users to be able to cast votes that persist and be aware of votes cast by other users, then you need a server that keeps track of it centrally. That's why the solutions you have found so far rely on having a model, presumably backed with a table.

simple_html_dom to crawl entire website [closed]

Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 8 years ago.
Improve this question
I want to crawl an entire website. I am using Simple_html_dom for parsing but the problem is that it takes only one webpage link at a time. I want to provide only start (home page) link and it should crawl and parse all the web pages of that website automatically. Any suggestion how to do this ?
When parsing the DOM of that single page, store all links (within the same domain) in an array. Then, at the end of parsing, check if the array isn't empty. If it isn't, take the first link and do the same.
So something like (code sample written with Python-like syntax, but you can adapt it to PHP easily - mine is rusty).
referenced_links = ['your_initial_page.html']
while referenced_links: # if the array isn't empty...
crawl_dom(referenced_links[0])
referenced_links.pop(0) # remove the first item in that array
def crawl_dom(url):
# download the url, parse the DOM and append all hyperlinks to the array referenced_links

In solr how do you maximize variability within a page of a result set? [closed]

Closed. This question needs details or clarity. It is not currently accepting answers.
Want to improve this question? Add details and clarify the problem by editing this post.
Closed 8 years ago.
Improve this question
This is for a product search where we want to show a variety of brands in the search results (24 per page). Ideally we would limit it to each item on the page must have a unique brand id, unless given the page in the result set, that is impossible.
Any ideas on how to accomplish this?
what a great question! There's no simple answer, but you might want to try using the grouping functionality that Solr provides. This would you give you one result per brand id that has matching results. If you don't get enough results from that, you could fall back to the "regular" search results. But I don't know of any way to distribute results evenly throughout your brands, because Solr results are ordered using a scoring mechanism where each result is scored independently. Perhaps you could write a Scorer with some memory that would down-weight subsequent matches from previously-seen brands?

what is the pros/cons to have unique&unpredictable id in url [closed]

Closed. This question is off-topic. It is not currently accepting answers.
Want to improve this question? Update the question so it's on-topic for Stack Overflow.
Closed 9 years ago.
Improve this question
I'm considering to use url pattern like below:
example.com/item/r6B0PmUmx07O/just-one-item
example.com/item/r6B0PGgwPJWl/yet-another-item
the part before slug is an unique and unpredictable id for an item.
compare with url like
example.com/item/1001/just-one-item
example.com/item/1002/yet-another-item
is this way bad for SEO?
or will it be bad for crawling by the search engine?(since the crawler cannot 'guess' the next item's id)
I'm not sure how many popular crawlers try to increment number values in URL to hit the page.
They generaly try to traverse by links.
But consider hiding some info from malicious users. If you can reach any info about your users (by example.com/user/1001) there is generally wrong idea to have sequential UID's. It's not mean to be a part of security but sometimes it's good to difficult access to your data. So the competition will have some difficulties when guessing how much products you have on stock :)
Consider supplying dynamical sitemap with links to all your products. This make you sure that every crawler will hit all your items no matter what key it has.

Google showing website inner search engine results, how does it work? [closed]

Closed. This question is off-topic. It is not currently accepting answers.
Want to improve this question? Update the question so it's on-topic for Stack Overflow.
Closed 11 years ago.
Improve this question
Sometimes when I search in Google, appears a website and just below... several links from that website. Sometimes also (I don't know if it's related) I click on a result and the website shows me the page with the search terms highlighted.
How does that work? I mean, which technology or standard do I have to implement in my website in order to archieve those effects?
Thanks
Do you mean sitelinks?
Google's systems analyse the link structure of your site to find shortcuts that will save users time and allow them to quickly find the information they're looking for.
They only show sitelinks for results when they think they'll be useful to the user.
You can read more here http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=47334
EDIT
To answer your question, all you can do is make sure you have a well formed site with clear navigation which your users can use and find useful. If your site popular then Google will do the rest.
Google shows highlighted search terms on its cached pages
When you're viewing a cached result the page is stored on Google's servers - so they can modify as they wish (highlighting search terms).
If you're viewing them on actual websites this is due to either:
Google wrapper around the page (such as mobile viewing)
Google toolbar (or similar)

Resources