TY - JOUR 
A1 - Pérez&#x20;Martínez,&#x20;Luis
T1 - Object&#x20;online&#x20;preclassifier&#x20;for&#x20;video&#x20;streams&#x20;using&#x20;GMM

Y1 - 2020
UR - http:&#x2F;&#x2F;hdl.handle.net&#x2F;10317&#x2F;9031
AB - [SPA]&#x20;Cada&#x20;día&#x20;hay&#x20;disponibles&#x20;más&#x20;y&#x20;más&#x20;datos&#x20;y&#x20;esta&#x20;tendencia&#x20;no&#x20;para&#x20;de&#x20;crecer.&#x0D;&#x0A;Por&#x20;ponerlo&#x20;en&#x20;contexto,&#x20;en&#x20;2019&#x20;cada&#x20;minuto&#x20;son&#x20;subidas&#x20;a&#x20;YouTube&#x20;más&#x20;de&#x0D;&#x0A;400&#x20;horas&#x20;de&#x20;video.&#x20;El&#x20;99,5%&#x20;de&#x20;los&#x20;datos&#x20;generados&#x20;no&#x20;se&#x20;analizan&#x20;debido&#x20;a&#x0D;&#x0A;falta&#x20;de&#x20;recursos&#x20;[1].&#x20;Toda&#x20;esta&#x20;información&#x20;se&#x20;hace&#x20;imposible&#x20;de&#x20;analizar&#x20;y&#x0D;&#x0A;visualizar&#x20;manualmente,&#x20;por&#x20;lo&#x20;que&#x20;es&#x20;necesario&#x20;estudiar&#x20;y&#x20;emplear&#x20;nuevas&#x0D;&#x0A;técnicas&#x20;de&#x20;análisis&#x20;de&#x20;información&#x20;y&#x20;videos.&#x0D;&#x0A;El&#x20;objetivo&#x20;de&#x20;este&#x20;proyecto&#x20;es&#x20;estudiar&#x20;la&#x20;viabilidad&#x20;de&#x20;aplicar&#x20;técnicas&#x20;de&#x0D;&#x0A;inteligencia&#x20;artificial&#x20;para&#x20;obtener,&#x20;analizar&#x20;y&#x20;clasificar&#x20;información&#x20;extraída&#x20;de&#x0D;&#x0A;videos&#x20;de&#x20;modo&#x20;desatendido.&#x20;Es&#x20;decir,&#x20;sin&#x20;partir&#x20;de&#x20;unos&#x20;datos&#x20;etiquetados&#x0D;&#x0A;previos.&#x0D;&#x0A;En&#x20;concreto,&#x20;nuestro&#x20;objetivo&#x20;es&#x20;ser&#x20;capaz&#x20;de&#x20;identificar&#x20;y&#x20;agrupar&#x20;objetos&#x20;en&#x0D;&#x0A;movimiento&#x20;en&#x20;un&#x20;video,&#x20;y&#x20;todo&#x20;ello&#x20;del&#x20;modo&#x20;más&#x20;simple&#x20;posible&#x20;y&#x20;con&#x20;la&#x0D;&#x0A;mínima&#x20;intervención&#x20;humana.&#x0D;&#x0A;Para&#x20;la&#x20;realización&#x20;del&#x20;estudio&#x20;hemos&#x20;utilizado&#x20;videos&#x20;disponibles&#x20;online.&#x20;En&#x0D;&#x0A;este&#x20;documento&#x20;se&#x20;recoge&#x20;en&#x20;concreto&#x20;la&#x20;aplicación&#x20;para&#x20;detectar&#x20;peces&#x20;en&#x20;un&#x0D;&#x0A;acuario&#x20;y&#x20;diferenciar&#x20;cuántas&#x20;especies&#x20;distintas&#x20;hay.&#x20;Ésta&#x20;es&#x20;simplemente&#x20;una&#x0D;&#x0A;aplicación&#x20;demostrativa,&#x20;pero&#x20;que&#x20;ilustra&#x20;potencialmente&#x20;los&#x20;pasos&#x20;de&#x0D;&#x0A;resolución&#x20;para&#x20;otro&#x20;tipo&#x20;de&#x20;problemas&#x20;similares.&#x0D;&#x0A;En&#x20;un&#x20;primer&#x20;paso&#x20;hemos&#x20;detectado&#x20;objetos&#x20;en&#x20;movimiento&#x20;extrayendo&#x20;un&#x0D;&#x0A;modelo&#x20;del&#x20;background&#x20;a&#x20;partir&#x20;del&#x20;video.&#x20;Los&#x20;objetos&#x20;en&#x20;primer&#x20;plano&#x20;son&#x0D;&#x0A;aquellos&#x20;que&#x20;se&#x20;mueven&#x20;entre&#x20;frames,&#x20;y&#x20;pueden&#x20;ser&#x20;peces,&#x20;plantas&#x20;u&#x20;otros&#x0D;&#x0A;objetos&#x20;móviles.&#x20;En&#x20;un&#x20;segundo&#x20;paso&#x20;estos&#x20;objetos&#x20;se&#x20;uniformizan&#x20;a&#x20;un&#x20;tamaño&#x0D;&#x0A;fijo&#x20;y&#x20;como&#x20;imágenes&#x20;en&#x20;escala&#x20;de&#x20;grises.&#x20;Tras&#x20;ello&#x20;se&#x20;compara&#x20;el&#x20;objeto&#x20;con&#x20;un&#x0D;&#x0A;modelo&#x20;Gaussiano&#x20;multidimensional&#x20;creado&#x20;a&#x20;partir&#x20;de&#x20;un&#x20;grupo&#x20;de&#x20;imágenes&#x0D;&#x0A;de&#x20;peces.&#x20;Obsérvese&#x20;que&#x20;este&#x20;paso&#x20;requiere&#x20;intervención&#x20;humana,&#x20;pero&#x20;es&#x20;más&#x0D;&#x0A;simple&#x20;que&#x20;un&#x20;modelo&#x20;supervisado&#x20;donde&#x20;se&#x20;requieren&#x20;también&#x20;etiquetar&#x0D;&#x0A;imágenes&#x20;negativas.&#x20;Por&#x20;último,&#x20;aquellas&#x20;imágenes&#x20;que&#x20;superan&#x20;este&#x20;test&#x20;son&#x0D;&#x0A;agrupadas&#x20;con&#x20;un&#x20;modelo&#x20;de&#x20;mixturas&#x20;gaussianas&#x20;(GMM),&#x20;usando&#x20;el&#x20;método&#x0D;&#x0A;del&#x20;codo&#x20;para&#x20;determinar&#x20;el&#x20;número&#x20;total&#x20;de&#x20;clusters&#x20;(especies).&#x20;Todo&#x20;este&#x0D;&#x0A;algoritmo&#x20;se&#x20;ha&#x20;desarrollado&#x20;mediante&#x20;Python&#x20;con&#x20;las&#x20;librerías&#x20;de&#x20;OpenCV&#x20;y&#x0D;&#x0A;Keras&#x2F;TF.&#x0D;&#x0A;Para&#x20;concluir&#x20;se&#x20;puede&#x20;afirmar&#x20;que&#x20;con&#x20;la&#x20;tecnología&#x20;actual&#x20;y&#x20;utilizando&#x20;las&#x0D;&#x0A;capacidades&#x20;del&#x20;lenguaje&#x20;de&#x20;programación&#x20;Python&#x20;y&#x20;las&#x20;librerías&#x20;disponibles&#x20;de&#x0D;&#x0A;inteligencia&#x20;artificial&#x20;es&#x20;posible&#x20;realizar&#x20;un&#x20;sistema&#x20;que&#x20;de&#x20;forma&#x20;semiautónoma&#x0D;&#x0A;pueda&#x20;realizar&#x20;un&#x20;análisis&#x20;de&#x20;la&#x20;información&#x20;contenido&#x20;en&#x20;videos&#x20;o&#x20;imágenes.&#x0D;&#x0A;Palabras&#x20;clave:&#x20;Big&#x20;data,&#x20;Inteligencia&#x20;artificial,&#x20;Python,&#x20;GMM,&#x20;Autoencoder,&#x0D;&#x0A;OpenCV,&#x20;Clasificación,&#x20;Aprendizaje&#x20;Supervisado.&#x0D;&#x0A;[ENG]&#x20;Every&#x20;day&#x20;more&#x20;and&#x20;more&#x20;data&#x20;are&#x20;generated,&#x20;this&#x20;trend&#x20;does&#x20;not&#x20;stop&#x20;growing&#x0D;&#x0A;up.&#x20;In&#x20;2019,&#x20;more&#x20;than&#x20;400&#x20;hours&#x20;of&#x20;video&#x20;are&#x20;uploaded&#x20;to&#x20;YouTube&#x20;every&#x0D;&#x0A;minute.&#x20;99.5%&#x20;of&#x20;the&#x20;data&#x20;generated&#x20;are&#x20;not&#x20;analysed&#x20;due&#x20;to&#x20;lack&#x20;of&#x20;resources&#x0D;&#x0A;or&#x20;techniques[1].&#x20;All&#x20;this&#x20;information&#x20;becomes&#x20;impossible&#x20;to&#x20;analyse&#x20;and&#x0D;&#x0A;visualize&#x20;by&#x20;hand.&#x20;Therefore,&#x20;is&#x20;necessary&#x20;an&#x20;evolution&#x20;of&#x20;information&#x20;and&#x20;video&#x0D;&#x0A;analysis&#x20;techniques&#x20;to&#x20;apply.&#x0D;&#x0A;The&#x20;objective&#x20;of&#x20;this&#x20;project&#x20;is&#x20;to&#x20;study&#x20;the&#x20;viability&#x20;of&#x20;apply&#x20;artificial&#x20;intelligence&#x0D;&#x0A;techniques&#x20;to&#x20;obtain,&#x20;analyse&#x20;and&#x20;classify&#x20;information&#x20;extracted&#x20;from&#x20;videos.&#x0D;&#x0A;To&#x20;carry&#x20;out&#x20;the&#x20;study&#x20;we&#x20;have&#x20;been&#x20;used&#x20;online&#x20;videos&#x20;available.&#x20;This&#x20;document&#x0D;&#x0A;specifically&#x20;collects&#x20;the&#x20;application&#x20;to&#x20;detect&#x20;fish&#x20;in&#x20;an&#x20;aquarium&#x20;and&#x0D;&#x0A;differentiate&#x20;how&#x20;many&#x20;different&#x20;species&#x20;there&#x20;are.&#x20;This&#x20;is&#x20;a&#x20;simple&#x0D;&#x0A;demonstration&#x20;application,&#x20;but&#x20;potentially&#x20;illustrates&#x20;the&#x20;resolution&#x20;steps&#x20;for&#x0D;&#x0A;other&#x20;similar&#x20;types&#x20;of&#x20;problems.&#x0D;&#x0A;In&#x20;a&#x20;first&#x20;step&#x20;we&#x20;have&#x20;detected&#x20;moving&#x20;objects&#x20;by&#x20;extracting&#x20;a&#x20;model&#x20;of&#x20;the&#x0D;&#x0A;background&#x20;from&#x20;the&#x20;video.&#x20;The&#x20;objects&#x20;in&#x20;the&#x20;foreground&#x20;are&#x20;those&#x20;that&#x20;move&#x0D;&#x0A;between&#x20;frames,&#x20;and&#x20;can&#x20;be&#x20;fish,&#x20;plants&#x20;or&#x20;other&#x20;moving&#x20;objects.&#x20;In&#x20;a&#x20;second&#x0D;&#x0A;step&#x20;these&#x20;objects&#x20;are&#x20;standardized&#x20;to&#x20;a&#x20;fixed&#x20;size&#x20;and&#x20;as&#x20;grayscale&#x20;images.&#x20;The&#x0D;&#x0A;object&#x20;is&#x20;then&#x20;compared&#x20;with&#x20;a&#x20;multidimensional&#x20;Gaussian&#x20;model&#x20;created&#x20;from&#x0D;&#x0A;a&#x20;group&#x20;of&#x20;fish&#x20;images.&#x20;Note&#x20;that&#x20;this&#x20;step&#x20;requires&#x20;human&#x20;intervention&#x20;but&#x20;is&#x0D;&#x0A;simpler&#x20;than&#x20;a&#x20;supervised&#x20;model&#x20;where&#x20;negative&#x20;images&#x20;are&#x20;also&#x20;required&#x20;to&#x20;be&#x0D;&#x0A;labelled.&#x20;Finally,&#x20;those&#x20;images&#x20;that&#x20;pass&#x20;this&#x20;test&#x20;are&#x20;grouped&#x20;with&#x20;a&#x20;Gaussian&#x0D;&#x0A;mixture&#x20;model&#x20;(GMM),&#x20;using&#x20;the&#x20;elbow&#x20;method&#x20;to&#x20;determine&#x20;the&#x20;total&#x20;number&#x0D;&#x0A;of&#x20;clusters&#x20;(species).&#x20;All&#x20;this&#x20;algorithm&#x20;has&#x20;been&#x20;developed&#x20;using&#x20;Python&#x20;with&#x0D;&#x0A;the&#x20;OpenCV&#x20;and&#x20;Keras&#x20;&#x2F;&#x20;TF&#x20;libraries.&#x0D;&#x0A;To&#x20;conclude,&#x20;it&#x20;can&#x20;be&#x20;stated&#x20;that&#x20;with&#x20;current&#x20;technology&#x20;and&#x20;using&#x20;the&#x0D;&#x0A;capabilities&#x20;of&#x20;the&#x20;Python&#x20;programming&#x20;language&#x20;and&#x20;the&#x20;available&#x20;artificial&#x0D;&#x0A;intelligence&#x20;libraries,&#x20;it&#x20;is&#x20;possible&#x20;to&#x20;create&#x20;a&#x20;semi-autonomous&#x20;system&#x20;that&#x20;can&#x0D;&#x0A;perform&#x20;an&#x20;analysis&#x20;of&#x20;the&#x20;information&#x20;contained&#x20;in&#x20;videos&#x20;or&#x20;images.&#x0D;&#x0A;Keyword:&#x20;Big&#x20;Data,&#x20;Artificial&#x20;Intelligence,&#x20;Python,&#x20;GMM,&#x20;Autoencoder,&#x20;OpenCV,&#x0D;&#x0A;Clustering,&#x20;Classification,&#x20;Supervised&#x20;learning
KW - Ingeniería&#x20;Telemática
KW - Inteligencia&#x20;artificial
KW - Artificial&#x20;intelligence
KW - Análisis&#x20;de&#x20;datos
KW - Data&#x20;analysis
KW - 1203.04&#x20;Inteligencia&#x20;Artificial
KW - 2209.90&#x20;Tratamiento&#x20;Digital.&#x20;Imágenes
LA - spa
ER -