Between Two Words
A portmanteau word, or portmanteau (/pɔːrtˈmæntoʊ/ (listen), /ˌpɔːrtmænˈtoʊ/) is a blend of words in which parts of multiple words are combined into a new word, as in smog, coined by blending smoke and fog, or motel, from motor and hotel. In linguistics, a portmanteau is a single morph that is analyzed as representing two (or more) underlying morphemes. When portmanteaus shorten established compounds, they can be considered clipped compounds.
Between Two Words
A portmanteau word is similar to a contraction, but contractions are formed from words that would otherwise appear together in sequence, such as do and not to make don't, whereas a portmanteau is formed by combining two or more existing words that all relate to a single concept. A portmanteau also differs from a compound, which does not involve the truncation of parts of the stems of the blended words. For instance, starfish is a compound, not a portmanteau, of star and fish, as it includes both words in full. If it were called a "stish" or a "starsh", it would be a portmanteau.
The word portmanteau was introduced in this sense by Lewis Carroll in the book Through the Looking-Glass (1871), where Humpty Dumpty explains to Alice the coinage of unusual words used in "Jabberwocky". Slithy means "slimy and lithe" and mimsy means "miserable and flimsy". Humpty Dumpty explains to Alice the practice of combining words in various ways, comparing it to the then-common type of luggage, which opens into two equal parts:
Some city names are portmanteaus of the border regions they straddle: Texarkana spreads across the Texas-Arkansas-Louisiana border, while Calexico and Mexicali are respectively the American and Mexican sides of a single conurbation. A scientific example is a liger, which is a cross between a male lion and a female tiger (a tigon is a similar cross in which the male is a tiger).
Jeoportmanteau! is a recurring category on the American television quiz show Jeopardy! The category's name is itself a portmanteau of the words Jeopardy and portmanteau. Responses in the category are portmanteaus constructed by fitting two words together.
Portmanteau words may be produced by joining proper nouns with common nouns, such as "gerrymandering", which refers to the scheme of Massachusetts Governor Elbridge Gerry for politically contrived redistricting; the perimeter of one of the districts thereby created resembled a very curvy salamander in outline. The term gerrymander has itself contributed to portmanteau terms bjelkemander and playmander.
The word refudiate was famously used by Sarah Palin when she misspoke, conflating the words refute and repudiate. Though the word was a gaffe, it was recognized as the New Oxford American Dictionary's "Word of the Year" in 2010.
The business lexicon includes words like "advertainment" (advertising as entertainment), "advertorial" (a blurred distinction between advertising and editorial), "infotainment" (information about entertainment or itself intended to entertain by its manner of presentation), and "infomercial" (informational commercial).
Two proper names can also be used in creating a portmanteau word in reference to the partnership between people, especially in cases where both persons are well-known, or sometimes to produce epithets such as "Billary" (referring to former United States president Bill Clinton and his wife, former United States Secretary of State Hillary Clinton). In this example of recent American political history, the purpose for blending is not so much to combine the meanings of the source words but "to suggest a resemblance of one named person to the other"; the effect is often derogatory, as linguist Benjamin Zimmer states. For instance, Putler is used by critics of Vladimir Putin, merging his name with Adolf Hitler. By contrast, the public, including the media, use portmanteaus to refer to their favorite pairings as a way to "...giv[e] people an essence of who they are within the same name." This is particularly seen in cases of fictional and real-life "supercouples". An early known example, Bennifer, referred to film stars Ben Affleck and Jennifer Lopez. Other examples include Brangelina (Brad Pitt and Angelina Jolie) and TomKat (Tom Cruise and Katie Holmes). On Wednesday, 28 June 2017, The New York Times crossword included the quip, "How I wish Natalie Portman dated Jacques Cousteau, so I could call them 'Portmanteau'".
There is a tradition of linguistic purism in Icelandic, and neologisms are frequently created from pre-existing words. For example, tölva 'computer' is a portmanteau of tala 'digit, number' and völva 'oracle, seeress'.
In the Malaysian national language of Bahasa Melayu, the word jadong was constructed out of three Malay words for evil (jahat), stupid (bodoh) and arrogant (sombong) to be used on the worst kinds of community and religious leaders who mislead naive, submissive and powerless folk under their thrall.
A very common type of portmanteau in Japanese forms one word from the beginnings of two others (that is, from two back-clippings). The portion of each input word retained is usually two morae, which is tantamount to one kanji in most words written in kanji.
The inputs to the process can be native words, Sino-Japanese words, gairaigo (later borrowings), or combinations thereof. A Sino-Japanese example is the name 東大 (Tōdai) for the University of Tokyo, in full 東京大学 (Tōkyō daigaku). With borrowings, typical results are words such as パソコン (pasokon), meaning personal computer (PC), which despite being formed of English elements does not exist in English; it is a uniquely Japanese contraction of the English personal computer (パーソナルコンピュータ, pāsonaru konpyūta). Another example, Pokémon (ポケモン), is a contracted form of the English words pocket (ポケット, poketto) and monsters (モンスター, monsutā). A famous example of a blend with mixed sources is karaoke (カラオケ, karaoke), blending the Japanese word for empty (空, kara) and the Greek word orchestra (オーケストラ, ōkesutora). The Japanese fad of egg-shaped keychain pet toys from the 1990s, Tamagotchi, is a portmanteau combining the two Japanese words tamago (たまご), which means "egg", and uotchi (ウオッチ) "watch". The portmanteau can also be seen as a combination of tamago (たまご), "egg", and tomodachi (友だち), which means "friend".
Some titles also are portmanteaus, such as Hetalia (ヘタリア). It came from Hetare (ヘタレ), which means "idiot", and Italia (イタリア) which means Italy. Another example is Servamp,which came from the English words Servant (サーヴァント) and Vampire (ヴァンパイア).
Selecting Expanded or Condensed alters the spacing between all selected letters by the same amount. Kerning alters the spacing between particular pairs of letters - in some cases reducing and in other cases expanding the space depending upon the letters.
Kerning refers to the way spacing between two specific characters is adjusted. The idea is to give a better looking result by reducing the spacing between characters that fit together nicely (such as "A" and "V") and increasing the spacing between characters that don't.
Is there a way of specifying the exact space, in some given unit, between two words when the text is fully justified? I normally use \hspacex cm when I want to increase/decrease the space relative to the original spacing post-justification by x cm. But how can I specify an exact space such that the spacing between the words does not change, regardless of what justification does to the base spacing?
I will be doing Audio to Text conversion which will result in an English dictionary or non dictionary word(s) ( This could be a Person or Company name) After that, I need to compare it to a known word or words.
In information theory and computer science, the Levenshtein distance is a string metric for measuring the difference between two sequences. Informally, the Levenshtein distance between two words is the minimum number of single-character edits (i.e. insertions, deletions or substitutions) required to change one word into the other.
The algorithm to compute these distances is not cheap however. If you need to do this on a big scale there are ways to use cosine similarity on bi-gram vectors that are a lot faster and easy to distribute if you need to find matches for a lot of words at once. They are however only an approximation to this distance.
An old and well-known technique for comparison is the Soundex algorithm.The idea is to compare not the words themselves but approximations of how they are pronounced. To what extent this actually improves the quality of the results I don't know.
However it feels a bit strange to apply something like Soundex to results from a speech-to-text recognition engine. First you throw away information about how the words are pronounced, then you try to add it back again. It would be better to combine these two phases.
Hence to compute PMI, I would need joint and individual probabilities of word1 and word2. I looked at the wikipedia miner relatedness score between two words. They are implementing a Milne and Witten algorithm. However, for defining topic similarities, PMI is a better score.
The number of times (N3) both words appear in a document together. You can also set a strong constraint so that the two words appear inside a 10-word (or 20-word) window context. Similarly, Prob(word1, word2) = (N3 + 1) / N. 041b061a72