php capitalize first letter
Как сделать первую букву заглавной php кирилица
Достаточно редко, но всё-же возникает такая необходимость, как сделать первую букву заглавной php кирилица.
Это когда первая буква слова становится большой. Применить это можно, например, для унификации написания имени пользователя, или, например, когда нужно автоматически составить текст в предложение.
Как сделать первую букву заглавной php латиница
Тут всё достаточно просто: в PHP есть 2 функции: ucfirst() и ucwords(). Первая делает только первую букву в строке заглавной, вторая делает первую букву каждого слова в строке заглавной.
В результате получим две строки:
First letters
и
First Letters
Видим разницу. С английскими текстами (или любыми другими), написанными латиницей проблем не будет.
Как сделать первую букву заглавной php кирилица Windows-1251 (CP-1251)
С кирилицей (русскими буквами) которые записаны в кодировки Windows-1251 тоже не будет больших проблем:
В результате получим две строки:
Первые буквы
и
Первые Буквы
Как сделать первую букву заглавной php кирилица UTF-8
Но как только дело доходит до UTF-8 начинаются проблемы, потому, что кирилица в UTF-8 занимает 2 байта, и поэтому ничего не получится. Для этого будем использовать «костыль» от Multibyte String Functions. Если этот плагин установлен на PHP, то можно просто использовать 2 аналогичные функции: mb_ucfirst и mb_convert_case.
А если их нет, то нужно дополнить код своими альтернативами:
Результатом этому коду станут такие строчки:
первые буквы
первые буквы
Первые буквы
Первые Буквы
Как видите, первые 2 строчки не сработали, из-за того, что обычные ucfirst и ucwords не умеют работать с кирилицей в UTF-8.
С помощью этих функций можно построить успешный бизнес на продвижении сайтов. Ну не только на этих функциях, но поверьте, в продвижении сайтов всегда нужно всё максимально автоматизировать.
ucfirst
(PHP 4, PHP 5, PHP 7, PHP 8)
ucfirst — Преобразует первый символ строки в верхний регистр
Описание
Принадлежность того или иного символа к буквенным определяется с учётом текущей локали. Это означает, что, например, в используемой по умолчанию локали «C», символ ä не будет преобразован.
Список параметров
Возвращаемые значения
Возвращает результирующую строку.
Примеры
Пример #1 Пример использования ucfirst()
Смотрите также
User Contributed Notes 35 notes
Simple multi-bytes ucfirst():
A proper Turkish solution;
?>
it also check is mb support enabled or not
This is what I use for converting strings to sentence case:
print sentence_case ( ‘HMM. WOW! WHAT?’ );
// Outputs: «Hmm. Wow! What?»
?>
Here’s a function to capitalize segments of a name, and put the rest into lower case. You can pass the characters you want to use as delimiters.
Implementation of multi-bytes ucfirst for «multiword»-strings (module mbstring is required):
Improved method of capitalizing first characters of sentences.
The first two manipulations (double spaces & all caps) are optional so can be removed without harm.
plemieux’ function did not work for me without passing the encoding to every single mb function (despite ini_set(‘default_charset’, ‘utf-8’) at the top of the script). This is the example that works in my application (PHP 4.3):
For some reason this worked for me.
Mac OS 10.5.1
PHP 5.2.6
Here is the fixed function for Turkish alphabet..
for anyone wanting to ucfirst each word in a sentence this works for me:
For lithuanian text with utf-8 encoding I use two functions (thanks [mattalexxpub at gmail dot com] and Svetoslav Marinov)
My version, converst first letter of the first word in the string to uppercase
public function mb_ucfirst($str) <
$aParts = explode(» «,$str);
$firstWord = mb_convert_case($aParts[0],MB_CASE_TITLE,»UTF-8″);
unset($aParts[0]);
I made a small change. Now it takes care of points in numbers
if you want to ucfirst for utf8 try this one:
( «UTF-8» );
mb_regex_encoding ( «UTF-8» );
function RemoveShouting($string)
<
$lower_exceptions = array(
«to» => «1», «a» => «1», «the» => «1», «of» => «1»
);
$higher_exceptions = array(
«I» => «1», «II» => «1», «III» => «1», «IV» => «1»,
«V» => «1», «VI» => «1», «VII» => «1», «VIII» => «1»,
«XI» => «1», «X» => «1»
);
Using this function for Turkish language is won’t work because of multi-byte characters. But you can use some tricks:
here is how mb_ucfirst should be implemented in userland
@adefoor, Ken and Zee
This is a simple code to get all the ‘bad words’, stored in a database, out of the text. You could use str_ireplace but since that’s installed on PHP5 only, this works as well. It strtolowers the text first then places capitals with ucfirst() where it thinks a capital should be placed, at a new sentence. The previous sentence is ended by ‘. ‘ then.
Ah, the last code were spoiled, here is the fixed one:
?>
So, this function changes also other letters into uppercase, ucfirst() does only change: a-z to: A-Z.
Note: the return for this function changed in versions 4.3 when a string is passed of length 0. In 4.3 a string of length 0 is returned.
Results for 4.3:
string(0) «» string(4) «Owen»
In the event you sort of need multiple delimiters to apply the same action to, you can preg_replace this «second delimiter» enveloping it with your actual delimiter.
A for instance, would be if you wanted to use something like Lee’s FormatName function in an input box designed for their full name as this script was only designed to check the last name as if it were the entire string. The problem is that you still want support for double-barreled names and you still want to be able to support the possibility that if the second part of the double-barreled name starts with «mc», that it will still be formatted correctly.
This example does a preg_replace that surrounds the separator with your actual delimiter. This is just a really quick alternative to writing some bigger fancier blah-blah function. If there’s a shorter, simpler way to do it, feel free to inform me. (Emphasis on shorter and simpler because that was the whole point of this.) 😀
Here’s the example. I’ve removed Lee’s comments as not to confuse them with my own.
ucwords
(PHP 4, PHP 5, PHP 7, PHP 8)
ucwords — Преобразует в верхний регистр первый символ каждого слова в строке
Описание
Список параметров
Необязательный параметр separators содержит символы разделителей слов.
Возвращаемые значения
Возвращает модифицированную строку.
Примеры
Пример #1 Пример использования ucwords()
Пример #2 Пример ucwords() с заданным разделителем
Пример #3 Пример использования ucwords() с дополнительными разделителями
Примечания
Замечание: Функция зависит от локали и будет обрабатывать ввод в соответствии с текущим установленным языковым стандартом. Однако она работает только с однобайтовыми наборами символов. Если вам нужно использовать многобайтовые символы (большинство языков, не входящих в Западную Европу), обратите внимание на модули multibyte или intl вместо нее.
Замечание: Эта функция безопасна для обработки данных в двоичной форме.
Смотрите также
User Contributed Notes 28 notes
My quick and dirty ucname (Upper Case Name) function.
//PRINTS:
/*
Jean-Luc Picard
Miles O’Brien
William Riker
Geordi La Forge
Beverly Crusher
*/
?>
You can add more delimiters in the for-each loop array if you want to handle more characters.
Para formatar nomes em pt-br:
Some recipes for switching between underscore and camelcase naming:
UTF-8 Title Case that works for me even with hyphens involved!
$str = ‘ĐaaČaa-AAAaaa, BBbb’;
$str = mb_convert_case($str, MB_CASE_TITLE, «UTF-8»);
echo($str): ‘Đaačaa-Aaaaaa, Bbbb’
In the function ucsmart() posted by ieure at php dot net on 04-Dec-2005 11:57, I found a similar problem in this function to what he found in igua’s.
Actually, it did end up changing the content for me (php 5.0.4) in the way that this function escapes a single quotation (apostrophe) in the MIDDLE of a word.
The fix is simple however, and merely requires fine-tuning the regular expression:
This will not escape a single quotation mark which occurs in the middle of a word. Though, you may find that might need to add other characters inside the regular expression if you use other special characters inside your words and if you get funky output.
It’s a great expression though! Simple, yet very powerful. Kudos!
Features:
— multi byte compatible
— handles multiple delimiters
Relating to the mb_ucwords() function posted by Anonymous. In order for this to actually be multi-byte compliant, you would also need to use mb_substr() and mb_strlen() instead of substr and strlen respectively.
Here it is corrected and extended even further to allow multiple word separators and a list of exceptions to correct after title casing. It’s a bit tedious and inelegant, but things frequently are when dealing with human languages.
function mb_ucwords($str) <
$exceptions = array();
$exceptions[‘Hp’] = ‘HP’;
$exceptions[‘Ibm’] = ‘IBM’;
$exceptions[‘Gb’] = ‘GB’;
$exceptions[‘Mb’] = ‘MB’;
$exceptions[‘Cd’] = ‘CD’;
$exceptions[‘Dvd’] = ‘DVD’;
$exceptions[‘Usb’] = ‘USB’;
$exceptions[‘Mm’] = ‘mm’;
$exceptions[‘Cm’] = ‘cm’;
// etc.
Turkish character with the ucwords function.
echo ucwords_tr ( ‘ŞEKardi ŞEMŞİYE ĞELENÖ ÖMER’ ); // Şekardi Şemşiye Ğelenö Ömer
echo ucwords_tr ( ‘şEKER iMSAK şÖLEN’ ); // Şeker İmsak Şölen
To avoid this i use a small function adding and deleting blanks behind these chars, and using ucwords() in between:
ucwords for UTF-8 strings:
«ieure at php dot net», your idea is pure poetry!
// captialize all first letters
$str = preg_replace(‘/\\b(\\w)/e’, ‘strtoupper(«$1»)’, strtolower(trim($str)));
ucwords for human names in Brazil.
ucwords personalizada para nomes próprios brasileiros.
Thanks a lot brother.
I tested it with a few variations. It works perfectly. Its really great and simple usage of the existing functions. It would be glad to all PHP folks and good to PHP if these kind of functions will be in PHP library on upcoming releases.
= «mark-yves robert» ;
$name2 = «mark-yves robert-bryan» ;
echo ‘
Name 2 (mark-yves robert-bryan)
?>
Proud to be a PHP enthusiast always 🙂
A very easy way to convert to title case:
function titleCase($string)
<
return ucwords(strtolower($string));
>
$myString = «SOME TEXT»;
//will print, «My Text»
Here is a function to capitalize a last name, accounting for hyphens, apostrophes, «Mc» and «Mac»:
This seems to be what people want:
[ed note: fixed the code to be correct]
The code posted above by Joerg Krause only works for a string which ends with one of the delimiters. A possible fix is:
I did the same thing as Catalin, but for French names.
Here’s what I’m doing :
This complies with the French rules for capitalization in names.
Sample results :
-d’Afoo Bar
-de Foo Bar
-O’Foo Bar
-Foo’bar
Based on code sample originally posted by «Anonymous» on 2005-12-14. The /e modifier is no longer supported by preg_replace(). Rewritten to use preg_replace_callback() instead. Tested with PHP 7.3.
// Trim whitespace and convert to lowercase
$str = strtolower(trim($string));
I have a batch of strings like so:
This can do most of what is needed but obviously also does it on iPad and iPhone :
How can I do achieve the below:
6 Answers 6
You can use str_replace for this. If you use arrays for the first two arguments, you can define a set of words and replacements:
From the documentation:
If search and replace are arrays, then str_replace() takes a value from each array and uses them to search and replace on subject.
As you know the specific words, and they are limited, why don’t you just revert them back after the total capitalization, just as following
Best practice is to call strtolower() on the input string straight away (syck’s answer doesn’t do this).
I’ll offer a pure regex solution that will appropriately target your ipad and iphone words and capitalize their seconds letter while capitalizing the first letter of all other words.
Probably the only parts to mention about the regex pattern is that \K means «restart the fullstring match» or in other words «consume and forget the previous character(s) in the current match».
Instead of having to write the lower- and uppercase version of each word you want to exclude respectively and thus having to write them twice, you could only define them once in an array and using str_ireplace instead of str_replace like this:
Which would result in
This would then replace all occurrences of these words with the version you’ve defined in the array.
Keep in mind that using this, words like «shipadvertise» would be replaced with «shiPadvertise». If you want to prevent this, you could use a more advanced solution using regular expressions:
This would then correctly resolve into
I’ve used the delimiters for determining words ucwords uses by default.
strtoupper
(PHP 4, PHP 5, PHP 7, PHP 8)
strtoupper — Преобразует строку в верхний регистр
Описание
Принадлежность того или иного символа к буквенным определяется с учётом текущей локали. Это означает, что, например, в используемой по умолчанию локали «C», символ ä не будет преобразован.
Список параметров
Возвращаемые значения
Возвращает строку в верхнем регистре.
Примеры
Пример #1 Пример использования strtoupper()
Примечания
Замечание: Эта функция безопасна для обработки данных в двоичной форме.
Смотрите также
User Contributed Notes 16 notes
One might think that setting the correct locale would do the trick with for example german umlauts, but this is not the case. You have to use mb_strtoupper() instead:
Here is how to make the character in upper case, except HTML-entities:
If you only need to extend the conversion by the characters of a certain language, it’s possible to control this using an environment variable to change the locale:
When using UTF-8 and need to convert to uppercase with
special characters like the german ä,ö,ü (didn’t test for french,polish,russian but think it should work, too) try this:
If you can’t find an appropriate locale setting, check your system configuration (locales are a system-wide setting, PHP gets them from the OS). On Windows, locales can be set from the Control Panel; on Linux it depends on your distribution. You can try «sudo dpkg-reconfigure locales» on Debian-based distros, or configure them manually. On Ubuntu Dapper, I had to copy entries over from /usr/share/i18n/SUPPORTED to /var/lib/locales/supported.d/local, then do the dpkg-reconfigure.
After you’re done, restart the web server.
That said, there are special cases where you want to do the conversion manually. In German, for example, the letter ‘ß’ (szlig) only exists as a lower-case character, and so doesn’t get converted by strtoupper. The convential way to express a ‘ß’ in an uppercase string is «SS». This function will take care of this exception (for Latin1 and most of Latin9, at least):