Sharepoint 2007 search not work for specific words "lånelöfte" - sharepoint-2007

We are using SharePoint 2007 default search, somehow it is not working for specific words such as " lånelöfte, Larmpaket, Vattenfelsbrytare". But If search for words such as "lanelöfte", "länelöfte" then it search properly and also displays result for word "lånelöfte".
We added this words in noise file, also set diacritic sensitive to true.

Related

Concatenating multiple hyperlinks in one cell in Google Sheets

Now that it is possible to have multiple links within the same Google Sheets cell, I'm trying to automate some grunt work with a Mail Merge add-on. The add-on takes multiple "attachments" that are hyperlinks separated by commas and does a great job of mailing them out if I copy and paste the links in manually.
I'm using this to build individual unique emails of tests for my students so I can type in the sheets Questions A, B and Q for student 1, Questions B, C and Q for student 2. I use VLOOKUP to grab the links from a separate page that replaces Question A with the hyperlink for Question A, etc. Then I used CONCATENTATE with commas and spaces between links to build the cell that has the 3 "attachments" in it.
But it only sees the text and doesn't turn it into an active hyperlink.
If I make a copy of that CONCATENATEd text and paste as value, then go in and press the space bar after the commas, the links turn blue or "live", and then it works.
I would love to automate this. Any suggestions?
I'd be interested in automating this as well but it seems unavailable at the moment. One possible workaround (albeit a less elegant one) is to use the plain URL itself (instead of hiding it in a hyperlinked text) and separate the concatenated links by a comma or a carriage return. Gmail tends to auto-convert plain URLs into hyperlinked text. You could try a test email to yourself via the above method to ensure the concatenated URLs are converted to clickable links.

JQL actual "contains"

I want to perform a simple search on a text field with part of its content but I don't know the beginning. I basically want what someone would expect of a "contains search". If I search issue for 345, I would want this result:
123456
234567
345678
...
Which, in JQL, would be the result of the query issue ~ "*345*", but the * is not allowed as first character in wildcard query. Is there an easy way to get this result, preferably with a JQL query?
Right now it's impossible to search JIRA for contains operation. As described in Search syntax for text fields, JIRA support Word stemming:
Since JIRA cannot search for issues containing parts of words, word
'stemming' allows you to retrieve issues from a search based on the
'root' (or 'stem') forms of words instead of requiring an exact match
with specific forms of these words. The number of issues retrieved
from a search based on a stemmed word is typically larger, since any
other issues containing words that are stemmed back to the same root
will also be retrieved in the search results.
That means, that you can search for common root of some word, but can't search for arbitrary part of it.
There is an issue in official JIRA bug tracker: Allow searching for part of a word (prefix / substring searches), which describes why this can't be implemented:
Lucene doesn't support prefix search.
As a workaround the suggestion is to use Script Runner plugin for JIRA:
issueFunction in issueFieldMatch("project = JRA", "description", "ABC\\d{4}")
See more on IssueFieldMatch here.
Another plugin, which can do regex jql is JQL Search Toolkit.
Filter issues by "345" substring in the Summary field:
summary ~ "345"
Filter issues by "345" substring in the Description field:
description ~ "345"

What would be the best search engine design for multi language search?

I have a database in which I store over 3 million documents with titles in different languages.
Each document has the following (simplified) structure:
{name: "The Intouchables",
detail: {
original_title: "Intouchables"
spanish_title: "Intocable"
}
}
My users search either in Spanish or English. Text index feature in Mongodb enables you to specify the language for each document and a default language. Having this into account, how would you design a great search engine that:
Searches fast (I would like to incorporate autocompletion soon) for titles
Is accurate
User can search in English or Spanish
For the time being, I would like to adjust to what Mongodb brings to the table, but I'm open to other technologies if they really change the game (Redis, Elasticsearch, etc.)
Some work I've already done:
I have indexed all my documents with default_language "none". That's inefficient due to the great amount of potential stop words that are stored. If I set default_language as English or Spanish, results are not accurate due to stop-word matching (it gives irrelevant results: for example, giving good score for titles with word "The"...a lot).
Some ideas:
Using mongoid_search (keywords based on specified fields) and make _keywords field text indexed.
Specify language override for titles in spanish. Make both English and Spanish search (two queries) and intersect results (not big fan).

Google Search Appliance wildcard character support

Does Google Search Appliance support wildcard characters in the query string. If no, is there any way I can fetch all the result set through my query?
The GSA does not support wildcarding. An option can be toN-Gram the fields or content that you want wildcarded. This would be achieved in your feeder or pipeline.
If waiting and upgrading the gsa software to v 7.2,.coming mid December is an option you will have wild card search built in.
Otherwise you have to dig deeper. A possible option is a document filter. If you are interested in that option I might be able to help.
I have developed such a document filter.
GSA software 7.4 has wildcard search built in. From documentation:
Enabling Wildcard Search
Wildcard search is a feature that enables your users to search by entering a word pattern rather than the exact spelling of a term. The search appliance supports two wildcard operators:
*--Matches zero or more characters
?--Matches exactly 1 character
Using wildcards can simplify queries for long names, technical data, pharmaceutical information, or strings where the exact spelling varies or is unknown. A user can search for all words starting with a particular pattern, ending with a particular pattern, or having a particular substring pattern.
By default, wildcard indexing is disabled for your search appliance. You can enable or disable wildcard indexing by using the Index > Index Settings page. You can disable or enable wildcard search for one or more front ends by using the Filters tab of the Search > Search Features > Front Ends page.
One way to get all indexed items from a collection is to use a query that will match every indexed record, e.g., supposing you're indexing some set of URLs from subdomain.companyname.com, just query for "companyname", with the "&num=1000&filter=0" query string parameters.

Represent the search result by adding relevant description

I'm developing simple search engine.If I search some thing using my search engine it will produce the list of urls which are relating with that search query.
I want to represent the search result by giving small,relevant description under each resulting url.(eg:- if we search something on google,you can see they will provide small description with the each resulting link.)
Any idea..?
Thank in advance!
You need to store position of each word in a webpage while indexing.
your index should contain- word id , document id of the document containing this word, number of occurrence of the word in that document , all the positions where the word occurred.
For more info you can read the research paper by Google founders-
The Anatomy of a Large-Scale Hypertextual Web Search Engine
You can fetch the meta content of that page and display it as a small description . Google also does this.

Resources