Текущий выпуск Номер 5, 2024 Том 16

Все выпуски

2024 Том 16
- Номер 5
- Номер 4
- Номер 3
- Номер 2
- Номер 1 (специальный выпуск)
2023 Том 15
- Номер 6
- Номер 5
- Номер 4 (специальный выпуск)
- Номер 3
- Номер 2 (специальный выпуск)
- Номер 1
2022 Том 14
- Номер 6
- Номер 5
- Номер 4 (специальный выпуск)
- Номер 3
- Номер 2 (специальный выпуск)
- Номер 1
2021 Том 13
- Номер 6
- Номер 5
- Номер 4
- Номер 3
- Номер 2 (специальный выпуск)
- Номер 1
2020 Том 12
2019 Том 11
2018 Том 10
- Номер 6
- Номер 5 (специальный выпуск)
- Номер 4
- Номер 3 (специальный выпуск)
- Номер 2
- Номер 1
2017 Том 9
2016 Том 8
2015 Том 7
- Номер 6
- Номер 5
- Номер 4
- Номер 3 (специальный выпуск)
- Номер 2
- Номер 1
2014 Том 6
- Номер 6 (специальный выпуск)
- Номер 5
- Номер 4
- Номер 3
- Номер 2
- Номер 1
2013 Том 5
- Номер 6 (специальный выпуск)
- Номер 5
- Номер 4
- Номер 3
- Номер 2
- Номер 1
2012 Том 4
2011 Том 3
2010 Том 2
2009 Том 1

Результаты поиска по 'probabilistic modeling':

Найдено статей: 22

Гаврилов С.В., Матюшкин И.В.
Статистический анализ блочно-поворотного механизма Марголуса в клеточно-автоматной модели диффузии в среде с дискретными особенностями
Компьютерные исследования и моделирование, 2015, т. 7, № 6, с. 1155-1175

Предложено обобщение блочного клеточного автомата Марголуса на гексагональную сетку. Проведена статистическая обработка результатов вероятностных клеточно-автоматных вычислений для ряда модификаций схемы, решающей тестовую задачу диффузии вещества. Показано, что выбор блоков в виде гексагонов на 25% эффективнее, чем в виде Y-блоков. Показано, что алгоритмы имеют полиномиальную сложность, причем степень полинома для параллельных вычислителей лежит в пределах 0.6÷0.8, а для последовательных — в пределах 1.5÷1.7. Исследовалось влияние внедренных в поле клеточного автомата дефектных ячеек на скорость сходимости.

Ключевые слова: диффузия, метод моделирования, дискретные особенности, блочные клеточные автоматы, окрестность Марголуса, гексагональная сетка.

Gavrilov S.V., Matyushkin I.V.
Statistical analysis of Margolus’s block-rotating mechanism cellular automation modeling the diffusion in a medium with discrete singularities
Computer Research and Modeling, 2015, v. 7, no. 6, pp. 1155-1175

The generalization of Margolus’s block cellular automaton on a hexagonal grid is formulated. Statistical analysis of the results of probabilistic cellular automation for vast variety of this scheme solving the test task of diffusion is done. It is shown that the choice of the hexagon blocks is 25% more efficient than Y-blocks. It is shown that the algorithms have polynomial complexity, and the polynom degree lies within 0.6÷0.8 for parallel computer, and in the range 1.5÷1.7 for serial computer. The effects of embedded into automaton’s field defective cells on the rate of convergence are studied also.

Keywords: diffusion, method of modeling, discrete singularities, block cellular automata, Margolus neighborhood, hexagonal grid.
Просмотров за год: 8. Цитирований: 4 (РИНЦ).
Гогуев М.В., Кислицын А.А.
Моделирование траекторий временных рядов с помощью уравнения Лиувилля
Компьютерные исследования и моделирование, 2024, т. 16, № 3, с. 585-598

Представлен алгоритм моделирования ансамбля траекторий нестационарных временных рядов. Построена численная схема аппроксимации выборочной плотности функции распределения в задаче с закрепленными концами, когда начальное распределение за заданное количество шагов переходит в определенное конечное распределение, так, что на каждом шаге выполняется полугрупповое свойство решения уравнения Лиувилля. Модель позволяет численно построить эволюционирующие плотности функций распределения при случайном переключении состояний системы, порождающей исходный временной ряд.

Основная проблема, рассматриваемая в работе, связана с тем, что при численной реализации левосторонней разностной производной по времени решение становится неустойчивым, но именно такой подход отвечает моделированию эволюции. При выборе неявных устойчивых схем с «заходом в будущее» используется итерационный процесс, который на каждом своем шаге не отвечает полугрупповому свойству. Если же моделируется некоторый реальный процесс, в котором предположительно имеет место целеполагание, то желательно использовать схемы, которые порождают модель переходного процесса. Такая модель используется в дальнейшем для того, чтобы построить предиктор разладки, который позволит определить, в какое именно состояние переходит изучаемый процесс до того, как он действительно в него перешел. Описываемая в статье модель может использоваться как инструментарий моделирования реальных нестационарных временных рядов.

Схема моделирования состоит в следующем. Из заданного временного ряда отбираются фрагменты, отвечающие определенным состояниям, например трендам с заданными углами наклона и дисперсиями. Из этих фрагментов составляются эталонные распределения состояний. Затем определяются эмпирические распределения длительностей пребывания системы в указанных состояниях и длительности времени перехода из состояния в состояние. В соответствии с этими эмпирическими распределениями строится вероятностная модель разладки и моделируются соответствующие траектории временного ряда.

Ключевые слова: нестационарный временной ряд, выборочная функция распределения, аппроксимация скорости, кинетическое уравнение, полугруппа.

Goguev M.V., Kislitsyn A.A.
Modeling time series trajectories using the Liouville equation
Computer Research and Modeling, 2024, v. 16, no. 3, pp. 585-598

This paper presents algorithm for modeling set of trajectories of non-stationary time series, based on a numerical scheme for approximating the sample density of the distribution function in a problem with fixed ends, when the initial distribution for a given number of steps transforms into a certain final distribution, so that at each step the semigroup property of solving the Liouville equation is satisfied. The model makes it possible to numerically construct evolving densities of distribution functions during random switching of states of the system generating the original time series.

The main problem is related to the fact that with the numerical implementation of the left-hand differential derivative in time, the solution becomes unstable, but such approach corresponds to the modeling of evolution. An integrative approach is used while choosing implicit stable schemes with “going into the future”, this does not match the semigroup property at each step. If, on the other hand, some real process is being modeled, in which goal-setting presumably takes place, then it is desirable to use schemes that generate a model of the transition process. Such model is used in the future in order to build a predictor of the disorder, which will allow you to determine exactly what state the process under study is going into, before the process really went into it. The model described in the article can be used as a tool for modeling real non-stationary time series.

Steps of the modeling scheme are described further. Fragments corresponding to certain states are selected from a given time series, for example, trends with specified slope angles and variances. Reference distributions of states are compiled from these fragments. Then the empirical distributions of the duration of the system’s stay in the specified states and the duration of the transition time from state to state are determined. In accordance with these empirical distributions, a probabilistic model of the disorder is constructed and the corresponding trajectories of the time series are modeled.

Keywords: nonstationary time series, sample distribution function, velocity approximation, kinetic equation, semigroup.
Воронцов К.В., Потапенко А.А.
Регуляризация, робастность и разреженность вероятностных тематических моделей
Компьютерные исследования и моделирование, 2012, т. 4, № 4, с. 693-706

Предлагается обобщенное семейство вероятностных тематических моделей коллекций текстовых документов, в котором эвристики регуляризации, сэмплирования, частого обновления параметров, робастности относительно шума и фона могут включаться независимо друг от друга в любых сочетаниях, порождая как известные модели PLSA, LDA, CVB0, SWB, так и новые. Показано, что робастная тематическая модель на основе PLSA, разделяющая термины на тематические, шумовые и фоновые, не нуждается в регуляризации и обеспечивает разреженность искомых дискретных распределений тем в документах и терминов в темах.

Ключевые слова: компьютерныйана лиз текстов, тематическое моделирование, вероятностныйла тентный семантическийана лиз, EM-алгоритм, латентное размещение Дирихле, сэмплирование Гиббса, байесовская регуляризация, перплексия, робастность.

Vorontsov K.V., Potapenko A.A.
Regularization, robustness and sparsity of probabilistic topic models
Computer Research and Modeling, 2012, v. 4, no. 4, pp. 693-706

We propose a generalized probabilistic topic model of text corpora which can incorporate heuristics of Bayesian regularization, sampling, frequent parameters update, and robustness in any combinations. Wellknown models PLSA, LDA, CVB0, SWB, and many others can be considered as special cases of the proposed broad family of models. We propose the robust PLSA model and show that it is more sparse and performs better that regularized models like LDA.

Keywords: text analysis, topic modeling, probabilistic latent semantic analysis, EM-algorithm, latent Dirichlet allocation, Gibbs sampling, Bayesian regularization, perplexity, robusteness.
Просмотров за год: 25. Цитирований: 12 (РИНЦ).
Жихаревич В.В., Шумиляк Л.М.
Аппроксимация решения нестационарного уравнения теплопроводности методом вероятностных непрерывных асинхронных клеточных автоматов для одномерного случая
Компьютерные исследования и моделирование, 2012, т. 4, № 2, с. 293-301

В статье рассматривается решение задач теплопроводности с помощью метода непрерывных асинхронных клеточных автоматов. Продемонстрировано согласование распределения температуры в образце между клеточно-автоматной моделью и точным аналитическим решением уравнения теплопереноса в определенный момент времени, что говорит о целесообразном использовании данного метода моделирования. Получена зависимость между временем одного клеточно-автоматного взаимодействия и размерностью клеточно-автоматного поля.

Ключевые слова: уравнение теплопроводности, клеточный автомат, время взаимодействия.

Zhуkharevуch V.V., Shumуlyak L.M.
Approximation of the solution of the non-stationary equation of heat conductivity by the method of probabilistic continuous asynchronous cellular automats for a one-dimensional case
Computer Research and Modeling, 2012, v. 4, no. 2, pp. 293-301

The solution of problems of heat conductivity by means of a method of continuous asynchronous cellular automats is considered in the article. Coordination of distribution of temperature in a sample at a given time between cellular automat model and the exact analytical solution of the equation of heattransfer is shown that speaks about expedient use of this method of modelling. Dependence between time of one cellular automatic interaction and dimension of a cellular automatic field is received.

Keywords: heat conductivity equation, cellular automat, time of interaction.
Просмотров за год: 10. Цитирований: 4 (РИНЦ).
Башкирцева И.А., Бояршинова П.В., Рязанова Т.В., Ряшко Л.Б.
Анализ индуцированного шумом разрушения режимов сосуществования в популяционной системе «хищник–жертва»
Компьютерные исследования и моделирование, 2016, т. 8, № 4, с. 647-660

Работа посвящена проблеме анализа близости популяционной системы к опасным границам, при пересечении которых в системе разрушается устойчивое сосуществование взаимодействующих популяций. В качестве причины такого разрушения рассматриваются случайные возмущения, неизбежно присутствующие в любой живой системе. Это исследование проводится на примере известной модели взаимодействия популяций хищника и жертвы, учитывающей как стабилизирующий фактор конкуренции хищника за отличные от жертвы ресурсы, так и дестабилизирующий фактор насыщения хищника. Для описания насыщения хищника используется трофическая функция Холлинга второго типа. Динамика системы исследуется в зависимости от коэффициента, характеризующего насыщение хищника, и коэффициента конкуренции хищника за отличные от жертвы ресурсы. В работе дается параметрическое описание возможных режимов динамики детерминированной модели, исследуются локальные и глобальные бифуркации и выделяются зоны устойчивого сосуществования популяций в равновесном и осцилляционном режимах. Интересной математической особенностью данной модели, впервые рассмотренной Базыкиным, является глобальная бифуркация рождения цикла из петли сепаратрисы. В работе исследуется воздействие шума на равновесный и осцилляционный режимы сосуществования популяций хищника и жертвы. Показано, что увеличение интенсивности случайных возмущений может привести к значительным деформациям этих режимов вплоть до их разрушения. Целью данной работы является разработка конструктивного вероятностного критерия близости этой стохастической системы к опасным границам. Основой предлагаемого математического подхода является техника функций стохастической чувствительности и метод доверительных областей — доверительных эллипсов, окружающих устойчивое равновесие, и доверительных полос вокруг устойчивого цикла. Размеры доверительных областей пропорциональны интенсивности шума и стохастической чувствительности исходных детерминированных аттракторов. Геометрическим критерием выхода популяционной системы из режима устойчивого сосуществования является пересечение доверительных областей и соответствующих сепаратрис детерминированной модели. Эффективность данного аналитического подхода подтверждается хорошим соответствием теоретических оценок и результатов прямого численного моделирования.

Ключевые слова: популяционная динамика, случайные возмущения, функция стохастической чувствительности, доверительные области.

Bashkirtseva I.A., Boyarshinova P.V., Ryazanova T.V., Ryashko L.B.
Analysis of noise-induced destruction of coexistence regimes in «prey–predator» population model
Computer Research and Modeling, 2016, v. 8, no. 4, pp. 647-660

The paper is devoted to the analysis of the proximity of the population system to dangerous boundaries. An intersection of these boundaries results in the collapse of the stable coexistence of interacting populations. As a reason of such destruction one can consider random perturbations inevitably presented in any living system. This study is carried out on the example of the well-known model of interaction between predator and prey populations, taking into account both a stabilizing factor of the competition of predators for another than prey resources, and also a destabilizing saturation factor for predators. To describe the saturation of predators, we use the second type Holling trophic function. The dynamics of the system is studied as a function of the predator saturation, and the coefficient of predator competition for resources other than prey. The paper presents a parametric description of the possible dynamic regimes of the deterministic model. Here, local and global bifurcations are studied, and areas of sustainable coexistence of populations in equilibrium and the oscillation modes are described. An interesting feature of this mathematical model, firstly considered by Bazykin, is a global bifurcation of the birth of limit cycle from the separatrix loop. We study the effects of noise on the equilibrium and oscillatory regimes of coexistence of predator and prey populations. It is shown that an increase of the intensity of random disturbances can lead to significant deformations of these regimes right up to their destruction. The aim of this work is to develop a constructive probabilistic criterion for the proximity of the population stochastic system to the dangerous boundaries. The proposed approach is based on the mathematical technique of stochastic sensitivity functions, and the method of confidence domains. In the case of a stable equilibrium, this confidence domain is an ellipse. For the stable cycle, this domain is a confidence band. The size of the confidence domain is proportional to the intensity of the noise and stochastic sensitivity of the initial deterministic attractor. A geometric criterion of the exit of the population system from sustainable coexistence mode is the intersection of the confidence domain and the corresponding separatrix of the unforced deterministic model. An effectiveness of this analytical approach is confirmed by the good agreement of theoretical estimates and results of direct numerical simulations.

Keywords: population dynamics, random disturbances, stochastic sensitivity function, confidence domains.
Просмотров за год: 14. Цитирований: 4 (РИНЦ).
Богомолов С.В.
Стохастическая формализация газодинамической иерархии
Компьютерные исследования и моделирование, 2022, т. 14, № 4, с. 767-779

Математические модели газовой динамики и ее вычислительная индустрия, на наш взгляд, далеки от совершенства. Мы посмотрим на эту проблематику с точки зрения ясной вероятностной микромодели газа из твердых сфер, опираясь как на теорию случайных процессов, так и на классическую кинетическую теорию в терминах плотностей функций распределения в фазовом пространстве; а именно, построим сначала систему нелинейных стохастических дифференциальных уравнений (СДУ), а затем обобщенное случайное и неслучайное интегро-дифференциальное уравнение Больцмана с учетом корреляций и флуктуаций. Ключевыми особенностями исходной модели являются случайный характер интенсивности скачкообразной меры и ее зависимость от самого процесса.

Кратко напомним переход ко все более грубым мезо-макроприближениям в соответствии с уменьшением параметра обезразмеривания, числа Кнудсена. Получим стохастические и неслучайные уравнения, сначала в фазовом пространстве (мезомодель в терминах СДУ по винеров- ским мерам и уравнения Колмогорова – Фоккера – Планка), а затем в координатном пространстве (макроуравнения, отличающиеся от системы уравнений Навье – Стокса и систем квазигазодинамики). Главным отличием этого вывода является более точное осреднение по скорости благодаря аналитическому решению стохастических дифференциальных уравнений по винеровской мере, в виде которых представлена промежуточная мезомодель в фазовом пространстве. Такой подход существенно отличается от традиционного, использующего не сам случайный процесс, а его функцию распределения. Акцент ставится на прозрачности допущений при переходе от одного уровня детализации к другому, а не на численных экспериментах, в которых содержатся дополнительные погрешности аппроксимации.

Теоретическая мощь микроскопического представления макроскопических явлений важна и как идейная опора методов частиц, альтернативных разностным и конечно-элементным.

Ключевые слова: уравнение Больцмана, уравнение Колмогорова – Фоккера – Планка, уравнение Навье – Стокса, уравнения стохастической газодинамики и квазигазодинамики, стохастические дифференциальные уравнения по бернуллиевой и винеровской мерам, методы частиц.

Bogomolov S.V.
Stochastic formalization of the gas dynamic hierarchy
Computer Research and Modeling, 2022, v. 14, no. 4, pp. 767-779

Mathematical models of gas dynamics and its computational industry, in our opinion, are far from perfect. We will look at this problem from the point of view of a clear probabilistic micro-model of a gas from hard spheres, relying on both the theory of random processes and the classical kinetic theory in terms of densities of distribution functions in phase space, namely, we will first construct a system of nonlinear stochastic differential equations (SDE), and then a generalized random and nonrandom integro-differential Boltzmann equation taking into account correlations and fluctuations. The key feature of the initial model is the random nature of the intensity of the jump measure and its dependence on the process itself.

Briefly recall the transition to increasingly coarse meso-macro approximations in accordance with a decrease in the dimensionalization parameter, the Knudsen number. We obtain stochastic and non-random equations, first in phase space (meso-model in terms of the Wiener — measure SDE and the Kolmogorov – Fokker – Planck equations), and then — in coordinate space (macro-equations that differ from the Navier – Stokes system of equations and quasi-gas dynamics systems). The main difference of this derivation is a more accurate averaging by velocity due to the analytical solution of stochastic differential equations with respect to the Wiener measure, in the form of which an intermediate meso-model in phase space is presented. This approach differs significantly from the traditional one, which uses not the random process itself, but its distribution function. The emphasis is placed on the transparency of assumptions during the transition from one level of detail to another, and not on numerical experiments, which contain additional approximation errors.

The theoretical power of the microscopic representation of macroscopic phenomena is also important as an ideological support for particle methods alternative to difference and finite element methods.

Keywords: Boltzmann equation, Kolmogorov – Fokker – Planck equation, Navier – Stokes equations, equations of stochastic gas dynamics and quasi-gas dynamics, stochastic differential equations with respect to Bernulli and Wiener measures, particle methods.
Лубашевский И.А., Лубашевский В.И.
Модель динамической ловушки для описания человеческого контроля в рамках «стимул – реакция»
Компьютерные исследования и моделирование, 2024, т. 16, № 1, с. 79-87

В статье предлагается новая модель динамической ловушки типа «стимул – реакция», которая имитирует человеческий контроль динамических систем, где ограниченная рациональность человеческого сознания играет существенную роль. Детально рассматривается сценарий, в котором субъект модулирует контролируемую переменную в ответ на определенный стимул. В этом контексте ограниченная рациональность человеческого сознания проявляется в неопределенности восприятия стимула и последующих действий субъекта. Модель предполагает, что когда интенсивность стимула падает ниже (размытого) порога восприятия стимула, субъект приостанавливает управление и поддерживает контролируемую переменную вблизи нуля с точностью, определяемую неопределенностью ее управления. Когда интенсивность стимула превышает неопределенность восприятия и становится доступной человеческому сознания, испытуемый активирует контроль. Тем самым, динамику системы можно представить как чередующуюся последовательность пассивного и активного режимов управления с вероятностными переходами между ними. Более того, ожидается, что эти переходы проявляют гистерезис из-за инерции принятия решений.

В общем случае пассивный и активный режимы базируются на различных механизмах, что является проблемой для создания эффективных алгоритмов их численного моделирования. Предлагаемая модель преодолевает эту проблему за счет введения динамической ловушки типа «стимул – реакция», имеющей сложную структуру. Область динамической ловушки включает две подобласти: область стагнации динамики системы и область гистерезиса. Модель основывается на формализме стохастических дифференциальных уравнений и описывает как вероятностные переходы между пассивным и активным режимами управления, так и внутреннюю динамику этих режимов в рамках единого представления. Предложенная модель воспроизводит ожидаемые свойства этих режимов управления, вероятностные переходы между ними и гистерезис вблизи порога восприятия. Кроме того, в предельном случае модель оказывается способной имитировать человеческий контроль, когда (1) активный режим представляет собой реализацию «разомкнутого» типа для локально запланированных действий и (2) активация контроля возникает только тогда, когда интенсивность стимула существенно возрастает и риск потери контроля системы становится существенным.

Ключевые слова: человеческий контроль, прерывистость, неопределенность, гистерезис, случайные процессы, стохастические дифференциальные уравнения.

Lubashevsky I.A., Lubashevskiy V.I.
Dynamical trap model for stimulus – response dynamics of human control
Computer Research and Modeling, 2024, v. 16, no. 1, pp. 79-87

We present a novel model for the dynamical trap of the stimulus – response type that mimics human control over dynamic systems when the bounded capacity of human cognition is a crucial factor. Our focus lies on scenarios where the subject modulates a control variable in response to a certain stimulus. In this context, the bounded capacity of human cognition manifests in the uncertainty of stimulus perception and the subsequent actions of the subject. The model suggests that when the stimulus intensity falls below the (blurred) threshold of stimulus perception, the subject suspends the control and maintains the control variable near zero with accuracy determined by the control uncertainty. As the stimulus intensity grows above the perception uncertainty and becomes accessible to human cognition, the subject activates control. Consequently, the system dynamics can be conceptualized as an alternating sequence of passive and active modes of control with probabilistic transitions between them. Moreover, these transitions are expected to display hysteresis due to decision-making inertia.

Generally, the passive and active modes of human control are governed by different mechanisms, posing challenges in developing efficient algorithms for their description and numerical simulation. The proposed model overcomes this problem by introducing the dynamical trap of the stimulus-response type, which has a complex structure. The dynamical trap region includes two subregions: the stagnation region and the hysteresis region. The model is based on the formalism of stochastic differential equations, capturing both probabilistic transitions between control suspension and activation as well as the internal dynamics of these modes within a unified framework. It reproduces the expected properties in control suspension and activation, probabilistic transitions between them, and hysteresis near the perception threshold. Additionally, in a limiting case, the model demonstrates the capability of mimicking a similar subject’s behavior when (1) the active mode represents an open-loop implementation of locally planned actions and (2) the control activation occurs only when the stimulus intensity grows substantially and the risk of the subject losing the control over the system dynamics becomes essential.

Keywords: human control, intermittency, uncertainty, hysteresis, stochastic process, stochastic differential equations.
Башкирцева И.А., Екатеринчук Е.Д., Рязанова Т.В., Сысолятина А.А.
Математическое моделирование стохастических равновесий и бизнес-циклов модели Гудвина
Компьютерные исследования и моделирование, 2013, т. 5, № 1, с. 107-118

В работе рассматривается модель экономической динамики Гудвина, находящаяся под воздействием случайных возмущений. Проведен полный параметрический анализ равновесий и циклов детерминированной системы. Исследованы вероятностные свойства аттракторов стохастической системы с использованием техники функций стохастической чувствительности и метода прямого численного моделирования. Обсуждается явление генерации стохастических бизнес-циклов в зоне, где исходная детерминированная модель имеет лишь устойчивые равновесия.

Ключевые слова: модель Гудвина, бизнес циклы, случайные возмущения, функция стохастической чувствительности, индуцированные шумом переходы.

Bashkirtseva I.A., Ekaterinchuk E.D., Ryazanova T.V., Sysolyatina A.A.
Mathematical modeling of stochastic equilibria and business cycles of Goodwin model
Computer Research and Modeling, 2013, v. 5, no. 1, pp. 107-118

The Goodwin dynamical model under the random external disturbances is considered. A full parametrical analysis for equlibria and cycles of deterministic model is developed. We study probabilistic properties of stochastic attractors using stochastic sensitivity functions technique and numerical methods. A phenomenon of the generation of stochastic business cycles in the zones of stable equilibria is discussed.

Keywords: Goodwin’s model, business cycle, random pertubation, stochastic sensitivity function, noise-induced transitions.
Просмотров за год: 5. Цитирований: 4 (РИНЦ).
Pham C.T., Phan M.N., Tran T.T.
Image classification based on deep learning with automatic relevance determination and structured Bayesian pruning
Компьютерные исследования и моделирование, 2024, т. 16, № 4, с. 927-938

Deep learning’s power stems from complex architectures; however, these can lead to overfitting, where models memorize training data and fail to generalize to unseen examples. This paper proposes a novel probabilistic approach to mitigate this issue. We introduce two key elements: Truncated Log-Uniform Prior and Truncated Log-Normal Variational Approximation, and Automatic Relevance Determination (ARD) with Bayesian Deep Neural Networks (BDNNs). Within the probabilistic framework, we employ a specially designed truncated log-uniform prior for noise. This prior acts as a regularizer, guiding the learning process towards simpler solutions and reducing overfitting. Additionally, a truncated log-normal variational approximation is used for efficient handling of the complex probability distributions inherent in deep learning models. ARD automatically identifies and removes irrelevant features or weights within a model. By integrating ARD with BDNNs, where weights have a probability distribution, we achieve a variational bound similar to the popular variational dropout technique. Dropout randomly drops neurons during training, encouraging the model not to rely heavily on any single feature. Our approach with ARD achieves similar benefits without the randomness of dropout, potentially leading to more stable training.

To evaluate our approach, we have tested the model on two datasets: the Canadian Institute For Advanced Research (CIFAR-10) for image classification and a dataset of Macroscopic Images of Wood, which is compiled from multiple macroscopic images of wood datasets. Our method is applied to established architectures like Visual Geometry Group (VGG) and Residual Network (ResNet). The results demonstrate significant improvements. The model reduced overfitting while maintaining, or even improving, the accuracy of the network’s predictions on classification tasks. This validates the effectiveness of our approach in enhancing the performance and generalization capabilities of deep learning models.

Ключевые слова: automatic relevance determination, Bayesian deep neural networks, truncated lognormal variational approximation, macroscopic image.

Pham C.T., Phan M.N., Tran T.T.
Image classification based on deep learning with automatic relevance determination and structured Bayesian pruning
Computer Research and Modeling, 2024, v. 16, no. 4, pp. 927-938

Deep learning’s power stems from complex architectures; however, these can lead to overfitting, where models memorize training data and fail to generalize to unseen examples. This paper proposes a novel probabilistic approach to mitigate this issue. We introduce two key elements: Truncated Log-Uniform Prior and Truncated Log-Normal Variational Approximation, and Automatic Relevance Determination (ARD) with Bayesian Deep Neural Networks (BDNNs). Within the probabilistic framework, we employ a specially designed truncated log-uniform prior for noise. This prior acts as a regularizer, guiding the learning process towards simpler solutions and reducing overfitting. Additionally, a truncated log-normal variational approximation is used for efficient handling of the complex probability distributions inherent in deep learning models. ARD automatically identifies and removes irrelevant features or weights within a model. By integrating ARD with BDNNs, where weights have a probability distribution, we achieve a variational bound similar to the popular variational dropout technique. Dropout randomly drops neurons during training, encouraging the model not to rely heavily on any single feature. Our approach with ARD achieves similar benefits without the randomness of dropout, potentially leading to more stable training.

To evaluate our approach, we have tested the model on two datasets: the Canadian Institute For Advanced Research (CIFAR-10) for image classification and a dataset of Macroscopic Images of Wood, which is compiled from multiple macroscopic images of wood datasets. Our method is applied to established architectures like Visual Geometry Group (VGG) and Residual Network (ResNet). The results demonstrate significant improvements. The model reduced overfitting while maintaining, or even improving, the accuracy of the network’s predictions on classification tasks. This validates the effectiveness of our approach in enhancing the performance and generalization capabilities of deep learning models.

Keywords: automatic relevance determination, Bayesian deep neural networks, truncated lognormal variational approximation, macroscopic image.
Давыдов Д.В., Шаповал А.Б., Ямилов А.И.
Распространение языков в КНР на уровне провинций: оценивание при неполных данных
Компьютерные исследования и моделирование, 2016, т. 8, № 4, с. 707-716

Данная работа посвящена решению практической задачи восстановления данных по распространению языков на региональном уровне на примере Китайской Народной Республики. Необходимость получения таких данных связана с задачей вычисления индексов лингвистического разнообразия, которые, в свою очередь, активно используются при эмпирическом анализе и прогнозе факторов социально-экономического развития, а также могут служить индикаторами потенциальных конфликтов на рассматриваемых территориях. В качестве исходной информации мы используем сведения из базы данных «Этнолог» (Ethnologue), дополняя их общедоступными данными переписей населения. Рассматриваемые нами данные содержат по каждому языку (а) оценку количества жителей страны, считающих этот язык родным, и (б) индикаторы наличия таких жителей в каждой из провинций КНР. Наша задача — для всех пар «язык–провинция» оценить количество жителей провинции, считающих этот язык родным. Она сводится к решению недоопределенной системы алгебраических уравнений. Специфика данных Ethnologue заключается в том, что, в силу большой трудоемкости и стоимости сбора таких данных, а также неполноты сведений по соответствующему разделу в переписях населения, имеющаяся информация по отдельным языкам в различных провинциях представлена за различные периоды времени. Одновременное использование таких данных приводит к тому, что возникающая система уравнений имеет неточно определенную правую часть, поэтому мы строим приближенное решение, характеризуемое минимальной невязкой. Учитывая неоднородность исходных данных (некоторые из языков оказываются на порядки менее распространенными), мы переходим к использованию взвешенной невязки, определяя в каждом уравнении весовые коэффициенты как величины, обратно пропорциональные правой части. Такой способ формирования невязки позволяет восстановить искомые переменные. Более 92% переменных оказываются устойчивыми к изменениям правой части при вероятностном моделировании ошибок записей в исходных данных.

Ключевые слова: использование языков в регионах, индексы неоднородности, восстановление неполных данных.

Davydov D.V., Shapoval A.B., Yamilov A.I.
Languages in China provinces: quantitative estimation with incomplete data
Computer Research and Modeling, 2016, v. 8, no. 4, pp. 707-716

This paper formulates and solves a practical problem of data recovery regarding the distribution of languages on regional level in context of China. The necessity of this recovery is related to the problem of the determination of the linguistic diversity indices, which, in turn, are used to analyze empirically and to predict sources of social and economic development as well as to indicate potential conflicts at regional level. We use Ethnologue database and China census as the initial data sources. For every language spoken in China, the data contains (a) an estimate of China residents who claim this language to be their mother tongue, and (b) indicators of the presence of such residents in China provinces. For each pair language/province, we aim to estimate the number of the province inhabitants that claim the language to be their mother tongue. This base problem is reduced to solving an undetermined system of algebraic equations. Given additional restriction that Ethnologue database introduces data collected at different time moments because of gaps in Ethnologue language surveys and accompanying data collection expenses, we relate those data to a single time moment, that turns the initial task to an ’ill-posed’ system of algebraic equations with imprecisely determined right hand side. Therefore, we are looking for an approximate solution characterized by a minimal discrepancy of the system. Since some languages are much less distributed than the others, we minimize the weighted discrepancy, introducing weights that are inverse to the right hand side elements of the equations. This definition of discrepancy allows to recover the required variables. More than 92% of the recovered variables are robust to probabilistic modelling procedure for potential errors in initial data.

Keywords: regional languages usage, dissimilarity indices, incomplete data identification.
Просмотров за год: 3.

Страницы: следующая последняя »

Журнал индексируется в Scopus

Полнотекстовая версия журнала доступна также на сайте научной электронной библиотеки eLIBRARY.RU

Журнал входит в систему Российского индекса научного цитирования.

Журнал включен в базу данных Russian Science Citation Index (RSCI) на платформе Web of Science

Международная Междисциплинарная Конференция "Математика. Компьютер. Образование"