Google Scholar

User profiles for A. Tamae

Aviv Tamar

Technion

Verified email at technion.ac.il

Cited by 12274

[PDF] neurips.cc

Multi-agent actor-critic for mixed cooperative-competitive environments

R Lowe, YI Wu, A Tamar, J Harb… - Advances in neural …, 2017 - proceedings.neurips.cc

We explore deep reinforcement learning methods for multi-agent domains. We begin by
analyzing the difficulty of traditional algorithms in the multi-agent case: Q-learning is challenged …

Save Cite Cited by 4567 Related articles All 9 versions View as HTML

[HTML] sciencedirect.com

[HTML][HTML] Stress hormones increase cell proliferation and regulates interleukin-6 secretion in human oral squamous cell carcinoma cells

DG Bernabé, AC Tamae, ÉR Biasoli… - Brain, behavior, and …, 2011 - Elsevier

Patients with oral cancer can have high psychological distress levels, but the effects of stress-related
hormones on oral cancer cells and possible mechanisms underlying these …

Save Cite Cited by 187 Related articles All 11 versions

[PDF] mlr.press

Constrained policy optimization

J Achiam, D Held, A Tamar… - … conference on machine …, 2017 - proceedings.mlr.press

For many applications of reinforcement learning it can be more convenient to specify both a
reward function and constraints, rather than trying to design behavior through the reward …

Save Cite Cited by 1327 Related articles All 8 versions View as HTML

[HTML] nih.gov

Cell‐type‐specific excitatory and inhibitory circuits involving primary afferents in the substantia gelatinosa of the rat spinal dorsal horn in vitro

…, MH Rashid, M Sonohata, A Tamae… - The Journal of …, 2007 - Wiley Online Library

The substantia gelatinosa (SG) of the spinal dorsal horn shows significant morphological
heterogeneity and receives primary afferent input predominantly from Aδ‐ and C‐fibres. …

Save Cite Cited by 172 Related articles All 10 versions

[PDF] neurips.cc

Value iteration networks

A Tamar, Y Wu, G Thomas… - Advances in neural …, 2016 - proceedings.neurips.cc

We introduce the value iteration network (VIN): a fully differentiable neural network with
aplanning module'embedded within. VINs can learn to plan, and are suitable for predicting …

Save Cite Cited by 723 Related articles All 23 versions View as HTML

[PDF] arxiv.org

Model-ensemble trust-region policy optimization

T Kurutach, I Clavera, Y Duan, A Tamar… - arXiv preprint arXiv …, 2018 - arxiv.org

Model-free reinforcement learning (RL) methods are succeeding in a growing number of
tasks, aided by recent advances in deep learning. However, they tend to suffer from high …

Save Cite Cited by 501 Related articles All 4 versions View as HTML

[PDF] icml.cc

[PDF][PDF] Policy gradients with variance related risk criteria

A Tamar, D Di Castro, S Mannor - Proceedings of the twenty-ninth …, 2012 - icml.cc

Managing risk in dynamic decision problems is of cardinal importance in many fields such as
finance and process control. The most common approach to defining risk is through various …

Save Cite Cited by 358 Related articles All 3 versions View as HTML

[HTML] nih.gov

Direct inhibition of substantia gelatinosa neurones in the rat spinal cord by activation of dopamine D2‐like receptors

A Tamae, T Nakatsuka, K Koga, G Kato… - The Journal of …, 2005 - Wiley Online Library

Dopaminergic innervation of the spinal cord is largely derived from the brain. To understand
the cellular mechanisms of antinociception mediated by descending dopaminergic pathways…

Save Cite Cited by 113 Related articles All 9 versions

[PDF] sigcomm.org

Learning to route

…, M Schapira, D Shahaf, A Tamar - Proceedings of the 16th …, 2017 - dl.acm.org

Recently, much attention has been devoted to the question of whether/when traditional network
protocol design, which relies on the application of algorithmic insights by human experts, …

Save Cite Cited by 245 Related articles All 3 versions

[PDF] arxiv.org

Variance adjusted actor critic algorithms

A Tamar, S Mannor - arXiv preprint arXiv:1310.3697, 2013 - arxiv.org

We present an actor-critic framework for MDPs where the objective is the variance-adjusted
expected return. Our critic uses linear function approximation, and we extend the concept of …

Save Cite Cited by 273 Related articles All 3 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library